• No results found

document collections

Visualizing Structured Text: A Prototype Graphical Tool for Analyzing Large Document Collections

Visualizing Structured Text: A Prototype Graphical Tool for Analyzing Large Document Collections

... digital document collections to make them more manageable (Furnas 2003, ...each document separately, use of a set of graphical document surrogates or structural summaries seems certain to ...

74

Making Sense of Document Collections with Map-Based Visualizations

Making Sense of Document Collections with Map-Based Visualizations

... the work at the level of a single document (i.e., searching, browsing, navigating, and identifying). Such tasks are typical for text representations. However, the transition from text to graphical representations ...

237

How Does Exploration Impact IR Performance in   Large Document Collections?

How Does Exploration Impact IR Performance in Large Document Collections?

... Abstract A significant problem for IR researchers is how to efficiently handle large collections of electronic documents. Manual review is time consuming and expensive. Automated methods can be imprecise and fail ...

19

A new weighting scheme and discriminative approach for information retrieval in static and dynamic document collections

A new weighting scheme and discriminative approach for information retrieval in static and dynamic document collections

... inverse document frequency ...dynamic document collections. Also, we used the document centroid vector as a discriminative approach in this ...The document centroid vector is the ...

8

Dynamic and Static Topic Model for Analyzing Time Series Document Collections

Dynamic and Static Topic Model for Analyzing Time Series Document Collections

... Probabilistic topic models such as latent Dirichlet allocation (LDA) (Blei et al., 2003) have been uti- lized for analyzing a wide variety of datasets such as document collections, images, and genes. Al- ...

5

Discovering Diverse and Salient Threads in Document Collections

Discovering Diverse and Salient Threads in Document Collections

... BILL BY GOP SENATORS INCREASES BORDER GUARDS; NEW SECURITY IS PART OF AN OVERALL IMMIGRATION PLANBILL BY GOP SENATORS INCREASES BORDER GUARDS; NEW SECURITY IS PART OF AN OVERALL IMMIGRAT[r] ...

11

Finding Maximal Sequential Patterns in Text Document Collections and Single Documents

Finding Maximal Sequential Patterns in Text Document Collections and Single Documents

... Most of the algorithms for sequential pattern mining have been developed for vertical databases, this is, databases with short sequences but with a large amount of sequences. A document database can be considered ...

10

Combinatoric models of information retrieval ranking methods and performance measures for weakly-ordered document collections

Combinatoric models of information retrieval ranking methods and performance measures for weakly-ordered document collections

... test collections started in 1992 with the Text REtrieval Con- ference ...the collections were so large. With the first generation test collections, it was possible — if one had enough persever- ance ...

604

DCU TCD@LogCLEF 2010: Re ranking Document Collections and Query Performance Estimation

DCU TCD@LogCLEF 2010: Re ranking Document Collections and Query Performance Estimation

... of collections that were searched for each query, and the collections that the user clicked ...of collections displayed to the user with the aim of increasing the retrieval precision (across the ...

14

Ranked Bandits in Metric Spaces: Learning Diverse Rankings over Large Document Collections

Ranked Bandits in Metric Spaces: Learning Diverse Rankings over Large Document Collections

... given document is included in the context—that is, if this document is skipped by the current user—then similar documents are likely to be skipped, ...

38

Indexing and Searching Document Collections using Lucene

Indexing and Searching Document Collections using Lucene

... model to support Boolean and fuzzy searching, but it essentially remains a VSM based system at the heart. In Lucene scoring is very much dependent on the way documents are indexed; the objects that are scored are the ...

47

Scalable Term Selection for Text Categorization

Scalable Term Selection for Text Categorization

... Two document collections are used in this ...primary document collection because Chinese text (as well as other Asian languages) has a very large term set and a satisfying subset is usually not ...

9

Literature Survey on Tourism Recommendation

Literature Survey on Tourism Recommendation

... for document collections, the author- topic model, that concurrently models the content of documents and the interests of ...each document with a mixture of topics, as in state-of-the-art approaches ...

7

Olelo: A Question Answering Application for Biomedicine

Olelo: A Question Answering Application for Biomedicine

... Olelo provides solutions for the shortcomings listed above: (i) It detects both the question type and answer type. (ii) It includes various NLP com- ponents and outputs answers in real time (cf. Sec- tion 5). (iii) It ...

6

DiscoFuse: A Large Scale Dataset for Discourse Based Sentence Fusion

DiscoFuse: A Large Scale Dataset for Discourse Based Sentence Fusion

... two document collections: Wikipedia and Sports articles, yielding 60 million fusion ex- amples annotated with discourse information required to reconstruct the fused ...

13

An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation

An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation

... state-of-the-art document em- bedding approaches, and that doc2vec performs particularly strongly over longer ...inducing document embed- dings using our trained ...

9

CiteRep   Journal Citation Statistics for Library Collections using Document Reference Extraction Techniques

CiteRep Journal Citation Statistics for Library Collections using Document Reference Extraction Techniques

... Providing access to journals often comes with a considerable subscription fee for universities. It is not always clear how these journal subscriptions actually contribute to ongoing research. This thesis provides a ...

66

Multiple Collections

Multiple Collections

... By reinterpreting the original object in various ways I saw the original cut nail form as a unifying element in the body of work and a base element to explore the unification and transla[r] ...

15

Dynamic Link Inclusion in Online PDF Journals

Dynamic Link Inclusion in Online PDF Journals

... Two complementary de facto standards for the publication of electronic doc- uments are HTML on the World Wide Web and Adobe's PDF (Portable Docu- ment Format) language for use with Acrobat viewers. Both these formats ...

14

Show all 10000 documents...

Related subjects