• No results found

Integrated Collaborative Information Systems

N/A
N/A
Protected

Academic year: 2020

Share "Integrated Collaborative Information Systems"

Copied!
32
0
0

Loading.... (view fulltext now)

Full text

(1)

Integrated Collaborative

Information Systems

Ahmet E. Topcu

at

[email protected]

(2)

Outline

p

Introduction

p

Motivation

p

Research Issues

p

Architecture

p

Measurements and Analysis

p

Conclusions

n

Contributions

(3)

Introduction

p

Efforts for collaboration and sharing between

users and communities in Web 2.0 domain

Web 2.0

p Represents new web-based services

p Provides rich and lightweight online tools

p Provides reusable services and data

p Updates software and data often very rapidly

p Provides interactive user interfaces

(4)

Web 2.0 Examples

p

Blogs (blogger.com, GoogleBlog)

p

Wikis(Wikipedia, WikiWikiWeb)

p

Social Networking Tools(MySpace ,LinkedIn)

p

Social Bookmarking Tools(del.icio.us ,YouTube)

p

Domain of scientific research (CiteULike , Connotea , and

Bibsonomy)

(5)

Motivation

p

Numerous annotation and search tools. Each of them

has different capability and not completed defined

metadata

p

Need for exploiting large set of data sources from

various tools

p

Integration of major annotation and search tools in

order to use them having additional functionalities for

scientific research

(6)

Motivation II

q

Necessities for integration

q

Need for common data format

q

No easy way to find all publications

q

Example: A search in Google Scholar for the

publications of our research lab (Community Grids

Lab) will return only about 20% of the total CGL

publications.

q

Wealth of information contained in numerous field

remains largely outside the scope of tools

(7)

Motivation Scenario :

Collection of

Information using Search Tools

p

The search tools have two main roles in the

usage scenarios of our system:

n

They will be used to

seed the creation of a community

(e.g., the papers of a research group, the papers on a

chemical compound, etc.).

n

These seeds will then be expanded and refined by our

(8)
(9)

Motivation Scenario II

: Collection of

Information using Search Tools

p

Extract information from Search Domain

n

Example: Using heuristic method for Google Scholar.

p

Extract information to build metadata having

search key

p

This model can be used for various search tools

n

Collect metadata for scholarly published papers.

(10)
(11)

Research Issues

p

Integration

n Building a model to

p integrate community tools and adding value to existing

systems

p natural collection of related documents easily p support more metadata

p support tagging

p

Scalability

n Investigate system behavior for increased message rate per

second

p

Flexibility and Extensibility

(12)

Architecture Principles

p Community-centric platform of services

p Integration of dynamic publication, search tools into

Cyberinfrastructure based scholarly research

p Integration such scientific research defining metadata and

using various URL, and map them

p Services that aggregate information from a variety of sources

(i.e., “mash-up” tools) and provide added value to communities of researchers

p Do not build a new tagging or search systems. Reuse the tools

and adding value to existing systems

p Easier to link together all relating information common Digital

(13)
(14)

Integrated Collaborative Information Systems (ICIS)

Architecture

p

Tools:

External web tools providing services to clients.

p

Clients:

Users to use the ICIS.

p

Gateways:

n Channels between tools and ICIS n Channels between clients and ICIS

p

Services

:

Collaborative environments for users to utilize

(15)

Integrated Collaborative Information Systems

(ICIS) Architecture Components

p

Tools

external web tools to provide services to clients

p

Integration Manager

have

information service and

provide communication between tools, client, and

responsible for integration operation in the system

p

Filter

operates two-way data filtering

p

Permission Handler

checks existing Digital Entity

(DE)s permission or build a new permission token for

new DEs

p

Data Manager

provides a mechanism to extract data

from a repository and insert data into a repository

(16)
(17)
(18)
(19)

Summary: Architecture

p

Build integration architecture

p

We do not reinvent existing tools

p

Use existing features of tools

p

Supports tagging services

p

Provides common metadata

p

Allows to use consistent data

(20)

Use Case:

Collection of Metadata from web pages

p

Collect

n Digital Entities in web pages using HTTP methods. p

Analyze

n Using heuristic methodology to extract metadata fields of the

Digital Entities for publications

p

Build

n RSS objects using collected Digital Entities. n New tags using collected Digital Entities.

p

Compare

n Collected Digital Entities from web pages with the existing

Digital Entities in ICIS repository.

p If they are:

§ different: Store new Digital Entities in ICIS repository.

§ same: Option to update tags and other fields for collected DEs

p

Share

(21)

Security Model

p

Security in web 2.0 can be limited.

p

We implemented a simple but more powerful security

model around local tools that wrap Web 2.0 systems.

p

We used an

access-control matrix

model to provide

security for our information system

n Supports multiple groups and multiple users for each Digital

Entity (DE).

n Similar to UNIX file system

p The Unix RWX bits corresponds to Read, Write, and Execute

operation for each file and directory.

n In our system, DE correspond to the file element and folder

corresponds to the directory element.

n For each DE and folder, there are three types of access rights

(22)

Security Model II

p

We have a security model that supports

n

Level of Authorization

p Roles are defined as Super Administrator (SA) and Group

Administrator (GA), User

p The system allows having more than one SA. p An existing SA can add other SAs to the system.

p SA can assign any User to become GA, and remove GA

from being group administrator.

p Each group should at least one GA. GA add/remove Users

from the group.

p Users can allow other Users and groups to share their

resources.

(23)

Security Model III

(24)

Benchmarks and Environments

p

Message rate scalability investigation

n

Search operation

v

Using Database Access

v

Using Memory Utilization

n

Test environments

v

Apache Axis version 1.2

v

Apache Tomcat Server version 5.0.28

v

Java 2 Runtime Environment, Standard Edition

(build 1.5.0_12-b04)

v

The maximum heap size of Java Virtual Machine(JVM)

is 1024 MB

(25)

Integrated Collaboration Information

System(ICIS) Framework

p

Search local repository using database access

(26)
(27)

Integrated Collaboration Information

System(ICIS) Framework II

p

Search local repository using memory with

(28)

Message rate scalability result

(29)

Contribution

p

System Research

n

Providing a architecture and model for

integration of collaborative systems

n

Integration and interoperability of annotation,

search tools, and web search tools

n

User collaboration and sharing resources.

n

Providing benchmarks to evaluate the

(30)

Contribution II

p

System Research

n Increasing performance and scalability using memory

utilization

n Providing flexibility allowing integration of different tools

having common metadata.

n Easy to add and extend service mechanism

n Supporting authorization and event based mechanism n Implementing a rather more powerful access control

mechanism

p

System Software

(31)

Future Works

p

Apply Integrated Collaboration Information

System(ICIS) Framework to other application

domains such as streaming collaboration systems

p

Integrate other collaboration and search tools

into ICIS Framework

n

CiteSeer

X

p

Use distributed storages instead of a single

storage

p

Expand our approaches to open-access scientific

(32)

References

Related documents

Number of employees 1 A Laboratory equipment Vice President of Supply Chain Small and Medium Enterprise (SME) 300 2 B Electromechanical medical technology Vice

In our opinion, the accompanying statement of assets and liabilities, including the statement of investments, and the related statements of operations and of changes in net assets

If the search were to be performed on one single node alone (ie without using Hyrax), not only would an index of available music files from other nodes need to be downloaded from

Second, we formulate the data transfer assignment problem as a novel linear programming model that determines the optimal logical paths consisting of a set of offloading spots

consumers which are informed by both firms, the captive segment of firm j is larger than that of firm i so that firm j is the less aggressive price-competitor and its expected

Related to Business Analytics, Data Analysis, Data Mining, Data Science, Machine Learning, Statistics.. Consumers generate

• medication can be linked to higher risk of gestational diabetes and, therefore, a blood sugar test must be performed at a maternity clinic for all HiV positive mothers who are 24

We have described HIT (and HIT m for multiple clusters), a hybrid clustering and indirect transmis- sion scheme for micro sensor networks. The novel feature of parallel,