• No results found

Teamproject & Project

N/A
N/A
Protected

Academic year: 2021

Share "Teamproject & Project"

Copied!
24
0
0

Loading.... (view fulltext now)

Full text

(1)

Teamproject & Project

Organizer: Prof. Dr. Georg Lausen Advisors: Alexander Schätzle,

Martin Przjyaciel-Zablocki, Thomas Hornung

(2)

Study regulations

◦ Master: 16 ECTS

 480 hours ~ 34h/week p.P.

◦ Bachelor: 6 ECTS

 180 hours ~ 13h/week p.P.

◦ Team size: 3-4 students

◦ Project report

◦ Final presentation

◦ Workload of every student must be clearly distinguishable

Cloud Computing 2

(3)

Time & Place:

◦ Monday 14-17 pm (c.t.) ◦ Room: SR 00 007 (MMR), Building 106 

Next Meeting:

◦ Monday, 8. November 2010 14-17 pm (c.t.) ◦ Room: SR 00 007 (MMR), Building 106 

Further Schedule:

◦ Meeting with short presentations of all groups

◦ Further individual meetings upon consultation

(4)

 Induction phase

◦ Until Tuesday, 2. November 2010 Today!

◦ Project placing + Classification of groups

 Short presentation ◦ 8. November 2010 ◦ Project introduction ◦ Milestones ◦ Internal work-sharing  Implementation phase

◦ Programming & Documentation

◦ 10. / 17. January 2011: Status report of the Milestones (Meeting or Presentation)

 Final presentation

◦ 7. February 2011

◦ Contribution of project report (14. February 2011)

4 Cloud Computing

Schedule

(5)
(6)

Facebook (2010)

1

◦ > 500 Million active users

saving profiles, pictures, comments, news

◦ > 900 Million sites, groups, events, …

◦ Usage: > 700 Billion minutes per month

How can we handle and analyze such huge

amounts of Data?

Solution: Distributed-Computing?

6 Bachelor Project Source:

(1) Facebook Press Room (22.09.2010)

http://www.facebook.com/press/info.php?statistics

(7)

Analysis of social Networks

Query Language with navigational capabilities

Friend of a Friend (FOAF) Queries

Search for shortest paths

Means of expression

Startnode (e.g. „Chris“)

Specification of edges (Location steps)

(e.g. „knows“)

Filter (e.g. age = 18, gender = female)

Shortest Paths

(8)

Data basis

Graph of a social Network (Last.fm)

Friendship relationships

Properties of Users and Tracks

Analyzing interesting characteristics

„Six-Degrees of Separation “

Representation: RDF-like Triples (no explicit URIs)

Last.fm

Musikdienst mit sozialem Netzwerk

Freundschaften, Musiktitel, Hör-Profile, u.v.m.!

8 Cloud Computing

Bachelor Project

(9)

Peter knows Simon Peter knows Frank Peter country DE Peter age 25 Frank knows Chris Frank age 8 ... Frank DE Simon Peter Klaus CH Chris 42 60 8 25 knows country age

(10)

Peter :: knows.

Ergebnis

◦ Peter (knows) Frank

◦ Peter (knows) Simon

◦ Peter (knows) Klaus

10 knows country age Cloud Computing Frank DE Simon Peter Klaus CH Chris 42 60 8 25 Bachelor Project

(11)

Peter :: knows

> knows > age.

Zwischenergebnisse

◦ Peter (knows) Frank

◦ Peter (knows) Klaus

◦ Peter (knows) Simon

Frank DE Simon Peter Klaus CH Chris 42 60 8 25 knows country age

(12)

Peter :: knows > knows > age.

Ergebnisse

◦ Peter (knows) Frank (knows) Chris (age) 60

◦ Peter (knows) Klaus (knows) Simon (age) 42

◦ Peter (knows) Simon

12 Frank DE Simon Peter Klaus CH Chris 42 60 8 25 knows country age Cloud Computing Bachelor Project

(13)

Peter :: knows

> country [equals(DE)].

Zwischenergebnisse

◦ Peter (knows) Frank

◦ Peter (knows) Klaus

◦ Peter (knows) Simon

knows country age Frank DE Simon Peter Klaus CH Chris 42 60 8 25

(14)

Peter :: knows > country [equals(DE)].

Ergebnis

◦ Peter (knows) Frank (country) DE

◦ Peter (knows) Klaus (country) CH

◦ Peter (knows) Simon (country) CH

14 Frank DE Simon Peter Klaus CH Chris 42 60 8 25 knows country age Cloud Computing Bachelor Project

(15)

Peter :: knows(

*

3).

Ergebnisse

◦ Peter (knows) Frank

◦ Peter (knows) Frank (knows) Chris

◦ Peter (knows) Klaus

◦ Peter (knows) Simon

Frank DE Simon Peter Klaus CH Chris 42 60 8 25 knows country age

(16)

16 Bachelor Project User Track Artist Album Gender

Realname Country Age Duration

Playcount

(17)

 User ◦ knows: 3814884 ◦ topArtists: 13101056 ◦ topTracks: 13264340 ◦ topAlbums: 13130619 ◦ listenedTo: 49830975 ◦ country: 180056 ◦ playcount: 269247 ◦ realname: 144040 ◦ gender: 268310 ◦ age: 153366  Album ◦ artist: 269900 ◦ tracks: 181624 ◦ playcount: 240347  Track ◦ artist: 271237 ◦ album: 181624 ◦ topFans: 8970531 ◦ duration: 271205 ◦ playcount: 271205  Artist ◦ tracks: 271234 ◦ album: 269898 ◦ topFans: 7439313 ◦ topTracks: 7728443 ◦ topAlbums: 936749 ◦ similar: 59471056

(18)

Evaluation

MapReduce-Framework HDFS

Query Engine

Parser/Interpreter Sequence of MapReduce Jobs Query

Handling Large RDF Graphs with

MapReduce 18

(19)

(1)

Distributed Analysis of Social Networks

◦ Parsing of a Path Query Language

◦ Translating a Query in a Sequence of MapReduce-Jobs

◦ Storing (intermediate) results in the Cluster

◦ Execution of MapReduce-Jobs with Hadoop

(2)

Means of Expression

◦ Startnode

◦ Multiple Location Steps

◦ Filter

◦ Shortest Paths

(20)

Master

20

(Master) Team Project

(21)

Study regulations

◦ Master: 16 ECTS

 480 hours ~ 34h/week p.P.

◦ Recommendation: no parallel lectures!

◦ Team size: 3-4 students

◦ Project report

(22)

Self-activated Investigation and Induction in the

needed topics:

◦ Triple Stores

◦ SPARQL

◦ Resource Description Framework (RDF)

◦ MapReduce

◦ Hadoop Distributed Filesystem

◦ HBase

 Workload of every student must be clearly distinguishable

Cloud Computing 22

(23)

Goal

◦ Design and Implemenation of a distributed RDF Triple Store built on top of Hadoop (MapReduce-Framework)

General Conditions

◦ SPARQL as Query Language

(at least Basic Graph Patterns + Filter)

◦ Execution in the MapReduce-Framework (Hadoop)

◦ Storage strategy using HDFS or HBase

(24)

 Now

◦ Group assignment: 3-4 students per Team

◦ Exchange contact informations (E-Mail, phone)

◦ Plan your next team internal meeting (as soon as possible)

 Until next meeting (8. November)

◦ Get to know the Project Task

◦ Investigation and conceptual Design

◦ Determine Milestones and Schedule (Recommendation: ~ 3 Milestones)

◦ Plan internal work-sharing (regarding individual skills)

 8. November

◦ Short presentation of all groups (5-10 minutes)

◦ Content: Project introduction, Milestones and internal work-sharing (perhaps overview of the planned architecture)

 Next Meeting

◦ Monday, 8. November 2010 14-17 pm (c.t.)

References

Related documents

It signals how the organized opposition against a fast transition went more public in March 2014, when the CEOs of the major energy companies in Europe proposed to change the course

Delta Modeling Languages: µ TVL, DML, CL, PSL Component Model Deployment Components: Real-Time ABS Core AST Name Resolution Resolved AST Type Checker Type-Checked AST ABS IDE. Java

communities and an analysis on cost and benefit of PCD mechanism was done in three pairs. punctulatus, an inland vector became very rare with interventions),

Designed to help users and installers save time and money while getting more productivity from satellite airtime for ship-to-shore and remote communications, the

As expected, and except in one isolated case (i.e., using Index as explanatory variable for the one-month forecast of the sign of the Repo change 11 ), the market-based measures

3 의 조사 국가 간 미국 대비 가격차이율(UNIQLO Dollar Variation) 은 최대 40% 정도로, UNIQLO의 글로벌 가격은 소 득의 높고 낮음에 대한 영향보다는 브랜드의

(Lin et al.'s ( 2006 ) comparison of licences also notes whether documentation of modifications made is stipulated.) 'Distribute?'—does the licence explicitly permit you to

Second statement: Title to a tax –exempt donation of a real property cannot be transferred to the donee in the Register of Deeds unless the donor’s tax on the donation had