• No results found

Machine Learning at DIKU

N/A
N/A
Protected

Academic year: 2021

Share "Machine Learning at DIKU"

Copied!
25
0
0

Loading.... (view fulltext now)

Full text

(1)

Faculty of Science

Machine Learning at DIKU

Christian Igel

(2)

Machine learning

Machine learning is a branch of computer science and applied statistics covering software that improves its performance at a

(3)

Why machine learning?

• Computer systems are required for tasks for which solutions

cannot be specified in the traditional way, e.g., because

• the designer’s knowledge is limited, and/or • the sheer complexity and variability precludes an

(4)

Why machine learning?

• Computer systems are required for tasks for which solutions

cannot be specified in the traditional way, e.g., because

• the designer’s knowledge is limited, and/or • the sheer complexity and variability precludes an

accurate description.

• However, large amounts of data describing the task are often

(5)

Why machine learning?

• Computer systems are required for tasks for which solutions

cannot be specified in the traditional way, e.g., because

• the designer’s knowledge is limited, and/or • the sheer complexity and variability precludes an

accurate description.

• However, large amounts of data describing the task are often

available or can be automatically obtained.

• To take proper advantage of this information, we need

systems that self-adapt and automatically improve based on sample data

(6)

Why machine learning?

• Computer systems are required for tasks for which solutions

cannot be specified in the traditional way, e.g., because

• the designer’s knowledge is limited, and/or • the sheer complexity and variability precludes an

accurate description.

• However, large amounts of data describing the task are often

available or can be automatically obtained.

• To take proper advantage of this information, we need

systems that self-adapt and automatically improve based on sample data – systems that learn.

(7)

Why machine learning?

• Computer systems are required for tasks for which solutions

cannot be specified in the traditional way, e.g., because

• the designer’s knowledge is limited, and/or • the sheer complexity and variability precludes an

accurate description.

• However, large amounts of data describing the task are often

available or can be automatically obtained.

• To take proper advantage of this information, we need

systems that self-adapt and automatically improve based on sample data – systems that learn.

(8)

Machine learning research at DIKU

We are concerned with the design and analysis of adaptive systems for pattern recognition (data mining, time series prediction), data modeling, and behaviour generation (decision making).

Our fields of expertise include

• state-of-the-art classification, regression, and density

estimation techniques,

• efficient and robust learning algorithms for large scale

problems, and

• computational intelligence methods for non-linear optimisation

including vector optimisation and multi-criteria decision making.

(9)

DIKU researchers in learning systems

Machine Learning Lab

http://image.diku.dk/MLLab

Image Group

http://www.diku.dk/forskning/Billedgruppen

DIKU faculty doing machine learning, information retrieval, and pattern recognition:Corinna Cortes (head of Google Research New York, adjunct), Marleen De Bruijne, Sune Darkner, Aasa Feragen, Christian Igel (head of ML Lab), Francois Lauze, Christina Lioma, Mads Nielsen (head of Image Group), Marco Loog (TU Delft, adjunct) Søren Olsen, Jon Sporring, Kim Steenstrup Pedersen, . . .

(10)

Important themes in our work

Autonomous learning

Technical systems shouldlearn robustly and autonomously, e.g., not requiring an expert to select

• learning algorithm and hyperparameters, • appropriate data representation, etc.

(11)

Important themes in our work

Autonomous learning

Technical systems shouldlearn robustly and autonomously, e.g., not requiring an expert to select

• learning algorithm and hyperparameters, • appropriate data representation, etc.

Scalability of adaptive systems

We need learning algorithms able to

• handle large amounts of data as well as to • generalise from few training examples.

(12)

Exemplary method:

(13)

Exemplary method:

(14)

Exemplary method:

Support Vector Machines (SVMs)

Φ Φ

(15)

Exemplary method:

Support Vector Machines (SVMs)

Φ Φ

(16)

Exemplary method:

Support Vector Machines (SVMs)

Φ Φ

(17)

Exemplary method:

Support Vector Machines (SVMs)

Φ Φ

(18)

Exemplary method:

Support Vector Machines (SVMs)

Φ Φ

(19)

Scaling up SVMs

SVMs give excellent results in practice and are well understood theoretically

(20)

Scaling up SVMs

SVMs give excellent results in practice and are well understood theoretically – but how to make them applicable to big data?

(21)

Scaling up SVMs

SVMs give excellent results in practice and are well understood theoretically – but how to make them applicable to big data? We address this issue by

1 new optimization algorithms,

Dogan, Glasmachers, Igel: Fast Training of Multi-class Support Vector Machines, submitted

2 new (e.g., cascaded) learning architectures,

Prasoon et al.: Cascaded classifier for large-scale data applied to automatic segmentation of articular cartilage. SPIE Medical Imaging, 2012

(22)

Example: Cartilage segmentation

10000 15000 20000 25000 30000 35000 40000 45000 50000 55000 60000 0 2 4 6 8 10 12 14 16 time number of cores

(23)

Business example: Credit scoring

Acredit scoremeasures the creditworthiness of a client.

Client applies for loan Application evaluation Loan evaluation Client granted loan Client declined loan Bad Good

(24)

Results from MSc thesis

L D A L O G K − N N RF C AR T C 4 .5 SVM Mo d . R F 0 .7 6 0 .7 8 0 .8 0 0 .8 2 0 .8 4 0 .8 6 0 .8 8 Accu ra cy 0.846 0.835 0.833

(25)

When theory and practice meet . . .

Markounikau, Igel, Jancke:PLoS Comp Biol6, 2010

Winter et al.:IEEE TEC12, 2008 Winter et al.:UMB 35, 2009 Roth, Igel, Handmann:IJCIA4, 2004

Mayr et al.:Analytical Chemistry75, 2003

References

Related documents

Objective: To maintain and expand specialized transitional residential services designed specifically for pregnant women and women with dependent children. Strategy: Services

In Massachusetts, trial courts have discretion to admit polygraph evidence, which is admissible only after a party can show the polygraph examiner is reliable through

Special cases and applications are described in a collection of related articles: estimator selection and performance assessment based on uncensored data (Dudoit and van der

Internet Enabled Users GSM Netwo rk GSM Modem Moderation Platform Audio Hosting File Sharing Users in Delhi Users in Rajasthan GSM Net wor k IVR Service Moderator Upload

Where you had the straight voluntary, you had sort of the first four questions voluntary, and then follow up to try to encourage people to participate even if in fact they were

The questions concerned risk factors and consequences of female urinary incontinence, its diagnosis and clinical assessment, urodynamic investigations,

Here, we use oyster shell oxygen isotope data to quantitatively compare modern and early 17th century James River estuarine conditions and to describe the season of early

33 The need for officers to check and care for men’s feet is one prominent example that Keegan gives (taken from Sassoon’s account of how this activity changed him as a