• No results found

Wikipedia Vandalism Data Sets for Research

Cross-language learning from bots and users to detect vandalism on Wikipedia

Cross-language learning from bots and users to detect vandalism on Wikipedia

... way Wikipedia identifies and bans vandals, but their contributions are often not considered nor ...of vandalism across five languages to learn and compare the contributions of bots and users in the task of ...

14

Beyond vandalism: Wikipedia trolls

Beyond vandalism: Wikipedia trolls

... Prior research about hackers’ motivations used, for example, the Flow Theory, which explicates intrinsic motivations with clear goals and adequate levels of challenge, to elucidate hackers’ motiva‐ tions ...

13

Harnessing Context for Vandalism Detection in Wikipedia

Harnessing Context for Vandalism Detection in Wikipedia

... as sets of respective keywords and ...the Wikipedia page (represented as D), then the keywords of E and D should occur together in a non-negligible fraction of WWW ...

14

Detecting vandalism on Wikipedia across multiple languages

Detecting vandalism on Wikipedia across multiple languages

... revisions data set to label the article views data ...views data means that we have incomplete evidence of vandalism for each hour, so real-time decisions of vandalism cases may be ...

179

Large SMT data sets extracted from Wikipedia

Large SMT data sets extracted from Wikipedia

... evaluation sets were selected. Yet, due to the nature of the Wikipedia article authoring, even this strategy of filtering the development and evaluation would not ensure an unbiased ...The Wikipedia ...

8

Automatic Vandalism Detection in Wikipedia: Towards a Machine Learning Approach

Automatic Vandalism Detection in Wikipedia: Towards a Machine Learning Approach

... regard vandalism as the positive class through- out this paper, then there will be both false positives and false ...fighting vandalism like undoing changes without proper references or prior ...when ...

6

Wikipedia Vandalism Detection: Combining Natural Language, Metadata, and Reputation Features

Wikipedia Vandalism Detection: Combining Natural Language, Metadata, and Reputation Features

... of Wikipedia vandalism that constitutes, at the time of writing, the best-performing published ...English Wikipedia). Since the systems are based largely on non-overlapping sets of features, ...

14

Wikipedia vandalism detection: combining natural language, metadata, and reputation features

Wikipedia vandalism detection: combining natural language, metadata, and reputation features

... of Wikipedia vandalism that constitutes, at the time of writing, the best-performing published ...English Wikipedia). Since the systems are based largely on non-overlapping sets of features, ...

13

Detecting Wikipedia Vandalism via Spatio-Temporal Analysis of Revision Metadata

Detecting Wikipedia Vandalism via Spatio-Temporal Analysis of Revision Metadata

... Figure 6: Classifier Precision/Recall/Accuracy Fig. 6, where both recall and precision are plotted for vari- able cost parameters. No matter the parameterization, our classifier is capable of steady-state performance ...

8

Detecting Wikipedia Vandalism via Spatio-Temporal Analysis of Revision Metadata

Detecting Wikipedia Vandalism via Spatio-Temporal Analysis of Revision Metadata

... Of course, the recall-to-precision ratio is tune-able. If one wants to find more vandalism (cumulatively), one must tol- erate more false-positives. This relationship is visualized in Fig. 6, where both recall and ...

9

The Class Imbalance Problem in the Machine Learning Based Detection of Vandalism in Wikipedia across Languages

The Class Imbalance Problem in the Machine Learning Based Detection of Vandalism in Wikipedia across Languages

... distribution can be adjusted towards a uniform distribution. Since classifiers, after oversampling, are trained by using some minority class values multiple times, the learned model is likely to over fit. The Synthetic ...

7

Detection of Vandalism in Wikipedia using Metadata Features – Implementation in Simple English and Albanian sections

Detection of Vandalism in Wikipedia using Metadata Features – Implementation in Simple English and Albanian sections

... a vandalism detection system on top of their WikiTrust reputation system ...facto data as features, for whose calculation also future revisions have to be ...of vandalism required building a separate ...

7

Vandalism Detection in Wikipedia: a Bag-of-Words Classifier Approach. November 11, Amit Belani

Vandalism Detection in Wikipedia: a Bag-of-Words Classifier Approach. November 11, Amit Belani

... English Wikipedia vandalism prediction ...the Wikipedia corpus, and a failure to capture their predictive potential may lead to suboptimal ...the data is unknown, and is a topic for future ...

15

Is macroeconomic research robust to alternative data sets?

Is macroeconomic research robust to alternative data sets?

... available data at the time this article was first ...the data, including new source data (information-based revisions) and definitional changes (structural ...

44

“Got You!”: Automatic Vandalism Detection in Wikipedia with Web based Shallow Syntactic Semantic Modeling

“Got You!”: Automatic Vandalism Detection in Wikipedia with Web based Shallow Syntactic Semantic Modeling

... the Wikipedia vandalism detection task, since we do not have a sufficient amount of training data to model the topic of each edit, we propose the idea of using the Web as corpus by retrieving search ...

9

A Review on Vandalism Detection in OpenStreetMap Data and Emerging Trends

A Review on Vandalism Detection in OpenStreetMap Data and Emerging Trends

... on vandalism, existing OSM information and successes following Wikipedia vandalism detection ...of vandalism as you will find few ...sounding vandalism or perhaps not will also be ...

7

Large-scale DNS and DNSSEC data sets for network security research

Large-scale DNS and DNSSEC data sets for network security research

... the data sets obtained dur- ing a study [1] we carried out to establish this ground ...these data sets available as open data under a permissive Creative Com- mons ...these data ...

10

Large-scale DNS and DNSSEC data sets for network security research

Large-scale DNS and DNSSEC data sets for network security research

... the data sets obtained dur- ing a study [1] we carried out to establish this ground ...these data sets available as open data under a permissive Creative Com- mons ...these data ...

10

The value of publicly available data sets for social science  research and evaluation

The value of publicly available data sets for social science research and evaluation

... this research, namely the inability to identify a time priority for violence ...rubric. Research in this tradition has focused mainly on housing, transportation and neighborhood characteristics, emphasizing ...

7

Research Article Piecewise Bivariate Hermite Interpolations for Large Sets of Scattered Data

Research Article Piecewise Bivariate Hermite Interpolations for Large Sets of Scattered Data

... scattered data are high accuracy and high ...scatted data, so this paper also gives the approximate derivative at every scatted point using local radial basis function ...

10

Show all 10000 documents...

Related subjects