• No results found

6.3 Query Analysis

6.3.1 Analysis on OpenResearch.org

On top of the basic architectural layers, OpenResearch offers services for different stakeholders of scientific communication. As a semantic wiki, it offers initial LOD services and semantic representation of metadata about events. We address the issues discussed in section 6.1.1 by establishing a set of quality metrics for scientific events and implementing them as properties. We adopt the definition of quality as fitness for use, which, here, means the extent to which the specification of an event satisfies

6.3 Query Analysis

its stakeholders [127, 144]. In the remainder of this section, the current services are explained in three categories: wiki pages, LOD services and queries.

Semantic Wiki Pages SMW powers OpenResearch to provide semantic representation of CfPs as one wiki page per event. In OpenResearch, specific semantic forms have been designed for each type of entities to make content creation and revision as easy as possible for users.

{{ Event ser ies | A c r o n y m = ESWC | Title = E x t e n d e d S e m a n t i c Web C o n f e r e n c e | has T w i t t e r = @ e s w c _ c o n f | has C O R E 2 0 1 4 Rank = A | has C O R E 2 0 1 7 Rank = A | has C O R E 2 0 1 8 Rank = A | Field = S e m a n t i c Web | H o m e p a g e = eswc - c o n f e r e n c e s . org

| has B i b l i o g r a p h y = dblp . uni - trier . de / db / conf / esws /}} [[ C a t e g o r y : C o n f e r e n c e se ries ]]

Listing 6.1: Metadata Representation on OR. Metadata of the ESWC conference series.

Properties of each semantic object are populated via fields in these semantic forms. The following example shows the generated SMW wiki markup containing general information about an event. Further information about committee members, extensions and other important dates can also be provided in other parts of the form. The complete textual representation of the CfPs can also be added as content of the wiki page with embedded semantic annotations.

All data created within OpenResearch is published as Linked Open Data (LOD). In the sequel, we describe ways for accessing OpenResearch LOD. Afterwards, we outlines how the LOD approach enables building further services on top by sketching two possible ways of consuming the OpenResearch LOD: interlinking with relevant datasets, and using OpenResearch LOD as external plug-in for the Fidus Writer scientific authoring platform16.

Implementation of the defined metrics and dimensions has been done with an on-demand decision making process. Some of the metrics suited to be defined as a raw property. The derived metrics have been computed by queries over the data (using MediaWiki expressions) for example acceptance rate := accepted/submitted; average acceptance rate over series):

AcceptanceRate=No. accepted papers No. submissions

The implementation of this composite that can be calculated from the raw properties has been done in the template of the corresponding entity(here event):

{{# ifeq :{

{{ S u b m i t t e d pap ers | } } | | | { { T a b l e r o w | Label = Pa pers :

| Value = S u b m i t t e d ([[ S u b m i t t e d pa pers ::{{{ S u b m i t t e d p apers } }}]]) / ([[ A c c e p t e d [[ A c c e p t e d paper s :={{{ A c c e p t e d pape rs }}}] ]) ([[ A c c e p t a n c e rate ::{{# expr :{{{ A c c e p t e d pape rs }}}/

{{{ S u b m i t t e d papers }}} * 100 round 1} }]]) \% }}}}

Accessing OpenResearch LOD An updated version of the OpenResearch dataset is produced daily and available for download and query.17. The data is also queryable via a SPARQL endpoint18. In addition, the semantic representation of the metadata for each event is represented as an RDF feed in each page. The RDF feed for the EKAW 2016 resource is available at http://openresearch.org/ Special:ExportRDF/EKAW_2016. To expose dereferenceable resources conforming with Linked Data best practices, the URI resolver provides URIs with content negotiation; e.g., for the EKAW 2016 resource the URI is http://openresearch.org/Special:URIResolver/EKAW_2016.

To support the creation of various views, recommendations and ranked lists (by quality indicators), queries can be defined and executed using all defined properties and classes and the results can be embedded in wiki pages. For example, events can be ranked by acceptance rate using the corresponding properties in queries:

{{# ask :[[ C a t e g o r y : Event ]] | ? title = Name

| ? Event in se ries = Se ries | ? C a t e g o r y | ? A c c e p t a n c e rate | for mat = table

| limit =10

| sort = A c c e p t a n c e rate | order = desc

}}

Listing 6.2: ASK Query on OR. Top 10 event series sorted with their acceptance rate.

It is also possible to capture the relationships between various types of entities (e.g. event series, sub/super events, roles of a person in event organization, etc.). Many popular views have been implemented in OpenResearch as pre-defined queries. Various display formats provided by SMW extensions are used to visualize the query results. Figure 6.10 shows a map view of the upcoming events using location-based filtering. Similarly, calendar and timeline views show upcoming submission and notification deadlines as well as the events themselves.

In addition, taking, for example, participation figures into account enables new indicators for meas- uring the quality and relevance of research that are not just based on citation counts [121]. Based on semantically enriched indicators, predefined SPARQL queries as well as form-based search facilities will be implemented for recommendation services.

Integration with an Authoring Platforms In this section we introduce our approach to improve the workflow of authoring processing [177]. The OpenResearch LOD will be plugged into the Fidus Writer authoring platform to improve the workflow in the following use cases:

1. Venue recommendation: One of the critical aspects in the process of writing and publishing is to find a suitable event to submit the scientific results. The OpenResearch dataset contains data about events annotated with corresponding scientific field as :category and keywords. We also annotate keywords from the content of the under-production scholarly document in the OSCOSS project that could be imported to the OpenResearch search services.For example, Find all events in the computer science field that focus on data analysis, big data, knowledge engineering, linked data. The result of queries can be shown to the authors with a user-friendly interface and filtering metrics such as deadline and location distance.

17https://zenodo.org/record/57899 18http://openresearch.org/sparql

6.3 Query Analysis

Figure 6.10: Geographic Data with Dynamic. Location-related distribution of upcoming events on a map view.

2. Direct link to submission pages: The OpenResearch data contains a property named submission linkthat provides a direct link to paper submission pages of events. The submission page of the targeted event can be made accessible easily from the authoring platform.

3. Notification services: there are different deadlines attached to the events that should be considered by authors such as abstract deadline, submission deadline or registration deadline as well as deadline extensions. Enabling notification services in the authoring platform will support both organizers and researchers.

The OR knowledge graph is built upon a combination of data captured by SAANSET and crowd sourcing, which is utilized to define and execute complex queries. Many convenient views, e.g., calendar view, map view, time line, have been implemented in OR as pre-defined queries. Various display formats provided by SMW extensions are also used to visualize the results of query. The following query utilizes the information captured by SAANSET (title, date, city, country, field, homepage) to answer a question that is otherwise cumbersome or impossible to answer for researchers.

Sample SPARQL queries are defined in the example page of OR19. The following visualization of metadata related to scholarly events have been implemented using MediWiki extensions. Many popular views have been implemented in OpenResearch as pre-defined queries. Various display formats provided by SMW extensions are used to visualize the query results. Figure 6.10 shows a map view of the upcoming events using location-based filtering. Users are enables to pin the locations and add them to their personal pages. Filtering can easily work in different granularity e.g., city, country, and continent.

Similarly, calendar and time line Figure 6.11 views show upcoming submission and notification deadlines as well as the events themselves. Every individual person or group can crate a list of events as agenda for a year. Listing 6.3 shows the corresponding query for these visualizations.

Figure 6.11: Timeline View. Time line of upcoming events.

SELE CT ? event ? e n d D a t e ? s t a r t D a t e ? city ? c o u n t r y ? h o m e p a g e

WHERE { ? e p r o p e r t y : H a s _ l o c a t i o n _ c o u n t r y c a t e g o r y : G e r m a n y . ? e rdfs : label ? event . ? e p r o p e r t y : H a s _ l o c a t i o n _ c i t y ? city . ? e p r o p e r t y : H a s _ l o c a t i o n _ c o u n t r y ? c o u n t r y . ? c o u n t r y rdfs : s u b C l a s s O f ? p a r t C o n t i n e n t . ? p a r t C o n t i n e n t rdfs : s u b C l a s s O f ? c o n t i n e n t . ? c o n t i n e n t rdfs : i s D e f i n e d B y site : C a t e g o r y : E urope . ? e a c a t e g o r y : S e m a n t i c _ W e b . ? e i c a l t z d : dtend ? e n d D a t e . ? e i c a l t z d : d t s t a r t ? s t a r t D a t e . ? e foaf : h o m e p a g e ? h o m e p a g e .

FILT ER (? s t a r t D a t e >= " 2018 -01 -01 " ^^ xsd : date && ? e n d D a t e < " 2019 -01 -01 " ^^ xsd : date ) . } B I N D I N G S ? E v e n t T y p e s {( s mwont : C o n f e r e n c e E v e n t )

( smw ont : W o r k s h o p E v e n t ) }

Listing 6.3: SPARQL Query on OR. Upcoming events wrt. specific criteria