Outline
p
Introduction
p
Motivation
p
Research Issues
p
Architecture
p
Measurements and Analysis
p
Conclusions
n
Contributions
Introduction
p
Efforts for collaboration and sharing between
users and communities in Web 2.0 domain
Web 2.0
p Represents new web-based services
p Provides rich and lightweight online tools
p Provides reusable services and data
p Updates software and data often very rapidly
p Provides interactive user interfaces
Web 2.0 Examples
pBlogs (blogger.com, GoogleBlog)
p
Wikis(Wikipedia, WikiWikiWeb)
p
Social Networking Tools(MySpace ,LinkedIn)
p
Social Bookmarking Tools(del.icio.us ,YouTube)
p
Domain of scientific research (CiteULike , Connotea , and
Bibsonomy)
Motivation
p
Numerous annotation and search tools. Each of them
has different capability and not completed defined
metadata
p
Need for exploiting large set of data sources from
various tools
p
Integration of major annotation and search tools in
order to use them having additional functionalities for
scientific research
Motivation II
q
Necessities for integration
q
Need for common data format
q
No easy way to find all publications
q
Example: A search in Google Scholar for the
publications of our research lab (Community Grids
Lab) will return only about 20% of the total CGL
publications.
q
Wealth of information contained in numerous field
remains largely outside the scope of tools
Motivation Scenario :
Collection of
Information using Search Tools
p
The search tools have two main roles in the
usage scenarios of our system:
n
They will be used to
seed the creation of a community
(e.g., the papers of a research group, the papers on a
chemical compound, etc.).
n
These seeds will then be expanded and refined by our
Motivation Scenario II
: Collection of
Information using Search Tools
p
Extract information from Search Domain
n
Example: Using heuristic method for Google Scholar.
p
Extract information to build metadata having
search key
p
This model can be used for various search tools
n
Collect metadata for scholarly published papers.
Research Issues
p
Integration
n Building a model to
p integrate community tools and adding value to existing
systems
p natural collection of related documents easily p support more metadata
p support tagging
p
Scalability
n Investigate system behavior for increased message rate per
second
p
Flexibility and Extensibility
Architecture Principles
p Community-centric platform of services
p Integration of dynamic publication, search tools into
Cyberinfrastructure based scholarly research
p Integration such scientific research defining metadata and
using various URL, and map them
p Services that aggregate information from a variety of sources
(i.e., “mash-up” tools) and provide added value to communities of researchers
p Do not build a new tagging or search systems. Reuse the tools
and adding value to existing systems
p Easier to link together all relating information common Digital
Integrated Collaborative Information Systems (ICIS)
Architecture
p
Tools:
External web tools providing services to clients.
pClients:
Users to use the ICIS.
p
Gateways:
n Channels between tools and ICIS n Channels between clients and ICIS
p
Services
:
Collaborative environments for users to utilize
Integrated Collaborative Information Systems
(ICIS) Architecture Components
p
Tools
external web tools to provide services to clients
pIntegration Manager
have
information service and
provide communication between tools, client, and
responsible for integration operation in the system
pFilter
operates two-way data filtering
p
Permission Handler
checks existing Digital Entity
(DE)s permission or build a new permission token for
new DEs
p
Data Manager
provides a mechanism to extract data
from a repository and insert data into a repository
Summary: Architecture
p
Build integration architecture
p
We do not reinvent existing tools
p
Use existing features of tools
p
Supports tagging services
p
Provides common metadata
p
Allows to use consistent data
Use Case:
Collection of Metadata from web pages
p
Collect
n Digital Entities in web pages using HTTP methods. p
Analyze
n Using heuristic methodology to extract metadata fields of the
Digital Entities for publications
p
Build
n RSS objects using collected Digital Entities. n New tags using collected Digital Entities.
p
Compare
n Collected Digital Entities from web pages with the existing
Digital Entities in ICIS repository.
p If they are:
§ different: Store new Digital Entities in ICIS repository.
§ same: Option to update tags and other fields for collected DEs
p
Share
Security Model
p
Security in web 2.0 can be limited.
p
We implemented a simple but more powerful security
model around local tools that wrap Web 2.0 systems.
p
We used an
access-control matrix
model to provide
security for our information system
n Supports multiple groups and multiple users for each Digital
Entity (DE).
n Similar to UNIX file system
p The Unix RWX bits corresponds to Read, Write, and Execute
operation for each file and directory.
n In our system, DE correspond to the file element and folder
corresponds to the directory element.
n For each DE and folder, there are three types of access rights
Security Model II
p
We have a security model that supports
n
Level of Authorization
p Roles are defined as Super Administrator (SA) and Group
Administrator (GA), User
p The system allows having more than one SA. p An existing SA can add other SAs to the system.
p SA can assign any User to become GA, and remove GA
from being group administrator.
p Each group should at least one GA. GA add/remove Users
from the group.
p Users can allow other Users and groups to share their
resources.
Security Model III
Benchmarks and Environments
p
Message rate scalability investigation
n
Search operation
v
Using Database Access
v
Using Memory Utilization
n
Test environments
v
Apache Axis version 1.2
v
Apache Tomcat Server version 5.0.28
v
Java 2 Runtime Environment, Standard Edition
(build 1.5.0_12-b04)
v
The maximum heap size of Java Virtual Machine(JVM)
is 1024 MB
Integrated Collaboration Information
System(ICIS) Framework
p
Search local repository using database access
Integrated Collaboration Information
System(ICIS) Framework II
p
Search local repository using memory with
Message rate scalability result
Contribution
p
System Research
n
Providing a architecture and model for
integration of collaborative systems
n
Integration and interoperability of annotation,
search tools, and web search tools
n
User collaboration and sharing resources.
n
Providing benchmarks to evaluate the
Contribution II
p
System Research
n Increasing performance and scalability using memory
utilization
n Providing flexibility allowing integration of different tools
having common metadata.
n Easy to add and extend service mechanism
n Supporting authorization and event based mechanism n Implementing a rather more powerful access control
mechanism
p
System Software
Future Works
p
Apply Integrated Collaboration Information
System(ICIS) Framework to other application
domains such as streaming collaboration systems
p
Integrate other collaboration and search tools
into ICIS Framework
n
CiteSeer
Xp
Use distributed storages instead of a single
storage
p