• No results found

large high dimensional datasets

Efficient Prediction Based Validation for Document Clustering

Efficient Prediction Based Validation for Document Clustering

... to large, high-dimensional datasets such as text ...a large number of real and artificial datasets, where it is shown to consistently produce good estimates for the optimal ...

17

Deterministic Annealing Dimension Reduction and Biology

Deterministic Annealing Dimension Reduction and Biology

... • Need is pervasive – Large and high dimensional data are everywhere: biology, physics, Internet, … – Visualization can help data analysis • Visualization of large datasets with high per[r] ...

32

A New Test for Large Dimensional Regression Coefficients

A New Test for Large Dimensional Regression Coefficients

... Some high dimensional data, such as gene expression datasets in microarray, exhibits the property that the number of covariates greatly exceeds the sample ...a large number of marginal ...

5

Efficient High Utility Top K Frequent Pattern Mining from  High Dimensional Datasets

Efficient High Utility Top K Frequent Pattern Mining from High Dimensional Datasets

... pattern tree data structures. It implements the parallel projection scheme to utilized disk storage. The algorithm CTU-Mine is proposed for mining HUI from the huge datasets. This algorithm first identifies the ...

7

TIMPs of parasitic helminths – a large scale analysis of high throughput sequence datasets

TIMPs of parasitic helminths – a large scale analysis of high throughput sequence datasets

... Secondary structure predictions and homology modelling Structure-based sequence alignments of TIMP proteins were computed and manually edited with SBAL [54] guided by secondary structure elements predicted using the ...

11

TIMPs of parasitic helminths: a large-scale analysis of high-throughput sequence datasets

TIMPs of parasitic helminths: a large-scale analysis of high-throughput sequence datasets

... Secondary structure predictions and homology modelling Structure-based sequence alignments of TIMP proteins were computed and manually edited with SBAL [54] guided by secondary structure elements predicted using the ...

11

High Performance Multidimensional Scaling for Large High Dimensional Data Visualization

High Performance Multidimensional Scaling for Large High Dimensional Data Visualization

... visualizing high-dimensional data is getting more difficult and challenged by the huge amount of the given ...such large and high-dimensional dataset, we have observed that such a task ...

14

A Non-Linear Chaotic Based PSO Feature Selection Approach For High Dimensional Data Classification

A Non-Linear Chaotic Based PSO Feature Selection Approach For High Dimensional Data Classification

... microarray high dimensional datasets is ...extreme large datasets without any ...microarray high dimensional datasets with nine state-of-the- art algorithms reveal ...

6

Deterministic Annealing and Robust Scalable Data mining for the Data Deluge

Deterministic Annealing and Robust Scalable Data mining for the Data Deluge

... Need is pervasive – Large and high dimensional data are everywhere: biology, physics, Internet, … – Visualization can help data analysis Visualization of large datasets with high perform[r] ...

33

Classification and Analysis of High Dimensional
          Datasets using Clustering and Decision tree

Classification and Analysis of High Dimensional Datasets using Clustering and Decision tree

... Classification algorithms discussed by Hanady Abdulsalam et al. [10], holds of three phases; a training phase that contains of labeled records, a test phase using earlier unseen labeled records, and a consumption or ...

5

High Performance Dimension Reduction and Visualization for Large High dimensional Data Analysis

High Performance Dimension Reduction and Visualization for Large High dimensional Data Analysis

... Abstract—Large high dimension datasets are of growing im- portance in many fields and it is important to be able to visualize them for understanding the results of data mining approaches or just for ...

10

ACCURATE AND EFFICIENT MINING FOR CONFIDENCE COLOSSAL PATTERNS FROM HIGH DIMENSIONAL DATASETS: CDFP-MINE

ACCURATE AND EFFICIENT MINING FOR CONFIDENCE COLOSSAL PATTERNS FROM HIGH DIMENSIONAL DATASETS: CDFP-MINE

... for high dimensional datasets, the pattern mining problem consumes greater time and ...handling high dimensional datasets due to its dimensionality, length and primary memory ...

15

Enhanced Manhattan-based Clustering using Fuzzy C-Means Algorithm for High Dimensional Datasets

Enhanced Manhattan-based Clustering using Fuzzy C-Means Algorithm for High Dimensional Datasets

... a high dimensional data includes a high computational cost, a high dimensional dataset composed of thousands of attribute and or ...requires high computational resources as it ...

6

Summarization techniques for visualization of large, multidimensional datasets

Summarization techniques for visualization of large, multidimensional datasets

... Summarization is performed using various techniques. These techniques are designed for the automated and unsupervised analysis and exploration of raw data, followed by the gen- eration of effective summaries based on the ...

19

MINING OF OUTLIER DETECTION IN LARGE CATEGORICAL DATASETS

MINING OF OUTLIER DETECTION IN LARGE CATEGORICAL DATASETS

... Outlier detection will typically be thought of as a pre-processing step for locating, throughout a data set, those objects that do not fits well-defined notions of expected behaviour. it is vital in process for locating ...

8

Adaptive Particle Swarm Optimization based Credentialed Extreme Learning Machine Classifier (APSO CELMC) for High Dimensional Datasets

Adaptive Particle Swarm Optimization based Credentialed Extreme Learning Machine Classifier (APSO CELMC) for High Dimensional Datasets

... Feature selection is the method of selecting a set of representative features / dimensions [1] which have high correlation with the output variables (forecasting variables). Researchers applied a variety of ...

7

Performance of lasso-penalized classifiers in high-dimensional datasets

Performance of lasso-penalized classifiers in high-dimensional datasets

... Statistical classification techniques are often required in clinical research, especially when developing diagnostic instruments. For example, we may wish to develop an instrument that can classify people into whether or ...

46

Hybrid Cloud and Cluster Computing Paradigms for Scalable Data Intensive Applications

Hybrid Cloud and Cluster Computing Paradigms for Scalable Data Intensive Applications

... Need is pervasive – Large and high dimensional data are everywhere: biology, physics, Internet, … – Visualization can help data analysis Visualization of large datasets with high perform[r] ...

62

Cloud Technologies and Their Applications

Cloud Technologies and Their Applications

... Need is pervasive – Large and high dimensional data are everywhere: biology, physics, Internet, … – Visualization can help data analysis Visualization of large datasets with high perform[r] ...

50

Implications of Clouds for Data Intensive Science with application to Biomedical Science

Implications of Clouds for Data Intensive Science with application to Biomedical Science

... Need is pervasive – Large and high dimensional data are everywhere: biology, physics, Internet, … – Visualization can help data analysis Visualization of large datasets with high perform[r] ...

35

Show all 10000 documents...

Related subjects