• No results found

Industrial Challenges for Content-Based Image Retrieval

N/A
N/A
Protected

Academic year: 2021

Share "Industrial Challenges for Content-Based Image Retrieval"

Copied!
27
0
0

Loading.... (view fulltext now)

Full text

(1)

Title Slide

Chahab Nastar, CEO

Vienna, 20 September 2005

Industrial Challenges for

(2)

Agenda

ƒ

CBIR – what is it good for?

ƒ

Technological challenges of CBIR

ƒ

Evaluating CBIR

ƒ

About LTU technologies

ƒ

Q&A

(3)

CBIR... what is it good for?

ƒ

Historically, (generic) CBIR is a cool technology...

- convergence of information retrieval and computer vision - « user in the loop », link to web search engines

- « query by example » approach

- feature extraction, fast search, relevance feedback Æ a lot of academic papers

ƒ

... in search of a useful application in the « real » world

- worst-case scenario for entrepreneurship: technology preceding market traction

(4)
(5)
(6)

CBIR’s redemption

ƒ

a powerful paradigm for object/scene recognition

ƒ

finding a series of images of the same scene/object

ƒ

user is in the loop

(7)

CBIR in the « real » world

Known world (reference)

B

Unknown world (query)

clone images duplicate images

similar images

(8)

User scenarios

ƒ

Verification:

1:1

- unrelated to information retrieval

ƒ

Identification:

1:N

- target search

- applications: stolen art, counterfeiting, media monitoring

ƒ

Investigation:

q:N

- find common ground between images

- applications: child exploitation, IT forensics, intelligence

ƒ

Classification:

A K

(9)

LTU’s technology – generalized CBIR

Input

Any digital image format, e-mail attachement, video frame,… Output Similarity Retrieval Clone Detection

Categorization face 100%

indoor 80%

Image Database / Knowledge Base

Visual Feature Extraction

color, shape, texture, arrangements, moments,…

Real-time Analysis

Image “DNA”

1

(10)

Technological challenges of CBIR

Metric Problem Application Visual/semantic distance Similarity:

A ~ B ? Visual retrieval

Matching distance Matching:

A = B ? « Clone » detection

Hamming distance Classification:

A K ? Image annotation

(11)

Technological challenges of CBIR

Metric Problem Application Visual/semantic distance Similarity:

A ~ B ? Visual retrieval

Matching distance Matching:

A = B ? « Clone » detection

Hamming distance Classification:

A K ? Image annotation

(12)

Clone Detection

Image clones same visual content, different graphical rendering (typically Photoshop manipulation)

cropped

reencoded

original old B&W negatized bad scan

(13)

Technological challenges of CBIR

Metric Problem Application Visual/semantic distance Similarity:

A ~ B ? Visual retrieval

Matching distance Matching:

A = B ? « Clone » detection

Hamming distance Classification:

A K ? Image annotation

(14)
(15)

Technological challenges of CBIR

Metric Problem Application Visual/semantic distance Similarity:

A ~ B ? Visual retrieval

Matching distance Matching:

A = B ? « Clone » detection

Hamming distance Classification:

A K ? Image annotation

(16)

Annotation by supervised learning

Classifier bank A A B B C C D D 1

1 00 11 11 00 11

Fusion module

Keyword Keyword

Finite

Finite vocabularyvocabulary

(17)
(18)

Evaluation of CBIR depends upon

ƒ

Image Content

- Scenes/objects represented - Image Quality

- Domain-specific knowledge

ƒ

User scenario

- identification

ƒ benchmarking database? - investigation

ƒ using a series of images - classification

ƒ

Usage scenario

- Search

ƒ Acceptable max rank of « retrieved images »? ƒ precision or recall?

(19)

Evaluation within a customer project

ƒ

Large database with Ground Truth

- provided by customer or

- put together by LTU

ƒ

User scenario & usage scenario taken into account

- in most mission-critical scenarios, maximizing recall is the main goal

(20)

The LTUtech dataset

ƒ

LTU has collected (mostly over the web) more than 80,000

images of 267 visual categories

ƒ

A visual category is a set of examples of an object

- animals, CD covers, gloves, watches etc.

ƒ

An average of 300 images per visual category

ƒ

The total database size is 1.3 GB

ƒ

The images vary in quality, size, format

ƒ

We do not known the copyright of the images

ƒ

The database is shared with the WP3 members of the

MUSCLE network

(21)

Golden Award for Technology Golden Award for Technology

The world’s top 100 innovators

MIT Technology Review

The world’s top 100 innovators

MIT Technology Review

European Information Society

Technologies Prize

European Information Society

Technologies Prize Europe’s Top 100 Companies

The Red Herring

Europe’s Top 100 Companies

The Red Herring

About LTU Technologies

ƒ Founded: 1999 from our group’s research at

INRIA (imedia research group)

ƒ Mission: Bringing image/video indexing/retrieval to the market

ƒ Headquartered: Paris & Washington D.C.

(22)

Value Prop: Law Enforcement & Intelligence

Providing tools to automatically “mine” information from large, unknown sets of images Data

Collection

Seized HD

Internet

P2P

Database of images w/ case information Database of suspect

Images

Content Management

(23)

Key-Clients (1)

Government, Police, International agencies

ƒ

Investigations

Child protection, Stolen art

ƒ

Computer forensics

Counterfeiting, ID theft

ƒ

Intelligence and monitoring

(24)

Value Prop: Media & Intellectual Property

Providing efficient access to images and tools for monitoring their usage

1 Search

2 Monitoring

3 Keywording

Manage Distribute

E-Mail Internet

(25)

Key-Clients (2)

Media and IP protection

ƒ

Image bank management

e-commerce, media

ƒ

Patent and design protection

Search, classification of designs

ƒ

Online IP protection

Logos, visual assets

Partners

ƒ

DAM software, text search engines

(26)

Summary

ƒ

Challenges in bringing CBIR to the market

- mission-critical applications

ƒ

Evaluation is case-by-case

- domain-specific

(27)

Questions & Answers

Chahab Nastar CEO

Paris, France

cn AT LTUtech.com

Chahab Nastar CEO

Paris, France

References

Related documents

Objective To determine whether ascertainment of childhood obesity by surveillance followed by structured secondary prevention in primary care improved outcomes in overweight or

Semantic connection set based massive RDF data query processing in Spark environment Xu and Zhang et al EURASIP Journal on Wireless Communications and Networking (2019) 2019 256 https

We believe that the ways in which the word, play, are used in the program documents across decades provide a sense of cultural values and understandings of early childhood, of

The 2015 Major League Baseball All-Star Game was played on July 14, 2015 at Great American Ball Park, the home ballpark of the Cincinnati Reds.. Great American Ball Park’s name

[4] S.P.Rajaram et al., “Optimal Placement of Distributed Generation for Voltage Stability Improvement and Loss Reduction in Distribution Network”

It includes the opportunities and threats of the Czech Republic and strengths and weaknesses of businesses, which managers of small and medium – sized enterprises identified in 2001

We therefore investigate the specific meteorological conditions (near-surface temper- ature inversion, relative humidity, surface temperature, wind speed and direction, in- and

and linguocultural features of proverbs in two languages: English and Uzbek,.. which do not belong to one language