• No results found

string similarity

Alignment Based Discriminative String Similarity

Alignment Based Discriminative String Similarity

... and string similarity might be learned jointly with a co-training or bootstrapping approach (Klementiev and Roth, ...discriminative string similarity with a more complex discriminative model ...

8

State of the art in string similarity search and join

State of the art in string similarity search and join

... crete similarity measure (edit distance, Jaccard, Ham- ming ...of string comparisons (global or local alignment, approximate substring search ...on string similarity search/join with edit ...

14

WORD SENSE DISAMBIGUATION USING FUZZY SEMANTIC-BASED STRING SIMILARITY MODEL

WORD SENSE DISAMBIGUATION USING FUZZY SEMANTIC-BASED STRING SIMILARITY MODEL

... Figure 2 represents the whole framework of WSD process by implementing the Fuzzy Semantic-Based String Similarity. The input sentence is labelled as ‘d’, where it needs to be pre-processed before it ...

8

Comparison of String Similarity Measures for Obscenity Filtering

Comparison of String Similarity Measures for Obscenity Filtering

... of string similarity measure in task of obscenity fil- tering is the time complexity of computing simi- larity ...different similarity measures using the following O – notation and the following ...

5

An evaluation of keyword, string similarity and very shallow syntactic matching for a university admissions processing infobot

An evaluation of keyword, string similarity and very shallow syntactic matching for a university admissions processing infobot

... Abstract. “Infobots” are small-scale natural language question answer- ing systems drawing inspiration from ELIZA-type systems. Their key dis- tinguishing feature is the extraction of meaning from users’ queries with- ...

24

Optimal Transport based Alignment of Learned Character Representations for String Similarity

Optimal Transport based Alignment of Learned Character Representations for String Similarity

... String similarity models are vital for record linkage, entity resolution, and ...the similarity of two ...each string, aligns the encodings using Sinkhorn It- eration (alignment is posed as an ...

11

Q-GRAM BASED JOIN FOR STRING SIMILARITY SEARCH

Q-GRAM BASED JOIN FOR STRING SIMILARITY SEARCH

... The string similarity join, which is employed to find similar string pairs from string sets, has received extensive attention in database and information retrieval ...the string ...

6

Learning to combine multiple string similarity metrics for effective toponym matching

Learning to combine multiple string similarity metrics for effective toponym matching

... cosine similarity between character n-grams), hybrid approaches ...of string similarity metrics, and that carefully tuning the similarity threshold is important for achieving good ...multiple ...

29

Harry: A Tool for Measuring String Similarity

Harry: A Tool for Measuring String Similarity

... their similarity is a basic operation in many application domains of machine learning, such as in information retrieval, natural language processing and ...available similarity measures for this task, each ...

5

Automatic Construction of Weighted String Similarity Measures

Automatic Construction of Weighted String Similarity Measures

... In this paper three approaches were introduced with the common goal of generating language dependent string matching functions automatically in order to improve the recognition of string[r] ...

7

Exploring Application Level Semantic for Data Compression

Exploring Application Level Semantic for Data Compression

... For string type and text type similarity, a dual variable length hidden Markov model is used and updated in this work for calculating similarity between text ...a string pair p(str1, str2), ...

5

Learning Phenotype Mapping for Integrating Large Genetic Data

Learning Phenotype Mapping for Integrating Large Genetic Data

... Accurate phenotype mapping will play an im- portant role in facilitating Phenome-Wide As- sociation Studies (PheWAS), and potentially in other phenomics based studies. The Phe- WAS approach investigates the association ...

9

A Comparative Study for String Metrics and the Feasibility of Joining them as Combined Text Similarity Measures

A Comparative Study for String Metrics and the Feasibility of Joining them as Combined Text Similarity Measures

... eight string similarity ...hybrid similarity metrics, called “fuzzy token matching based similarity,” which extends token-based similarity functions ...Jaccard similarity and ...

13

Automatically Constructing a Normalisation Dictionary for Microblogs

Automatically Constructing a Normalisation Dictionary for Microblogs

... In Step 1, we leverage large volumes of Twitter data to identify the most distributionally-similar IV type for each OOV type. The result of this pro- cess is a set of (OOV, IV) pairs, ranked by dis- tributional ...

12

Combining string and phonetic similarity matching to identify misspelt names of drugs in medical records written in Portuguese

Combining string and phonetic similarity matching to identify misspelt names of drugs in medical records written in Portuguese

... use string similarity methods to search for valid words within a text, coupled with a supporting ...joint string and language-dependent phonetic similarity is more accurate than traditional ...

7

Unsupervised Mining of Lexical Variants from Noisy Text

Unsupervised Mining of Lexical Variants from Noisy Text

... a string similarity measure, to form a highly effective hybrid noisy text normalization ...heuristic string similarity-based approach handled many of the less common test cases from the tail ...

9

An Unsupervised Method for Discovering Lexical Variations in Roman Urdu Informal Text

An Unsupervised Method for Discovering Lexical Variations in Roman Urdu Informal Text

... Figures 1 and 2 show results of selected ex- periments for Web and SMS datasets respectively. The x-axis shows the experiment (Exp.) IDs while the left y-axis gives the precision, recall, and f- measure and the right ...

6

Unsupervised Resolution of Objects and Relations on the Web

Unsupervised Resolution of Objects and Relations on the Web

... The particular choice of α and β make little differ- ence to our results, so long as they are chosen such that the resulting probability can never be one or zero. In our experiments α = 20 and β = 5, and we use the ...

10

Large Graph Database for Subgraph Matching with Set Similarity Using String Metric Algorithm

Large Graph Database for Subgraph Matching with Set Similarity Using String Metric Algorithm

... of similarity headquartered on connectivity alone, and proposes a few algorithms to quantify ...founded similarity estimation between two nodes are described, one headquartered on the separate neighborhood ...

5

Splitting of Compound Terms in non Prototypical Compounding Languages

Splitting of Compound Terms in non Prototypical Compounding Languages

... of string similarity may introduce some false positive splitting, but at the same time, it allows detection of additional components not cov- ered by the ...

9

Show all 5787 documents...

Related subjects