Title | ||
---|---|---|
S3QL: A distributed domain specific language for controlled semantic integration of life sciences data. |
Abstract | ||
---|---|---|
Background The value and usefulness of data increases when it is explicitly interlinked with related data. This is the core principle
of Linked Data. For life sciences researchers, harnessing the power of Linked Data to improve biological discovery is still
challenged by a need to keep pace with rapidly evolving domains and requirements for collaboration and control as well as
with the reference semantic web ontologies and standards. Knowledge organization systems (KOSs) can provide an abstraction
for publishing biological discoveries as Linked Data without complicating transactions with contextual minutia such as provenance
and access control.
We have previously described the Simple Sloppy Semantic Database (S3DB) as an efficient model for creating knowledge organization
systems using Linked Data best practices with explicit distinction between domain and instantiation and support for a permission
control mechanism that automatically migrates between the two. In this report we present a domain specific language, the S3DB
query language (S3QL), to operate on its underlying core model and facilitate management of Linked Data.
Results Reflecting the data driven nature of our approach, S3QL has been implemented as an application programming interface for S3DB
systems hosting biomedical data, and its syntax was subsequently generalized beyond the S3DB core model. This achievement
is illustrated with the assembly of an S3QL query to manage entities from the Simple Knowledge Organization System. The illustrative
use cases include gastrointestinal clinical trials, genomic characterization of cancer by The Cancer Genome Atlas (TCGA) and
molecular epidemiology of infectious diseases.
Conclusions S3QL was found to provide a convenient mechanism to represent context for interoperation between public and private datasets
hosted at biomedical research institutions and linked data formalisms. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1186/1471-2105-12-285 | BMC Bioinformatics |
Keywords | Field | DocType |
relational data,application program interface,use case,infectious disease,biology,semantics,rdf,access control,molecular epidemiology,internet,clinical trial,best practice,query language,semantic web,domain specific language,semantic integration,programming languages,simple knowledge organization system,database management systems,sparql,linked data | Ontology (information science),Semantic integration,Computer science,Semantic Web,Linked data,SPARQL,Simple Sloppy Semantic Database,Simple Knowledge Organization System,Bioinformatics,Knowledge organization | Journal |
Volume | Issue | ISSN |
12 | 1 | 1471-2105 |
Citations | PageRank | References |
14 | 0.45 | 20 |
Authors | ||
8 |
Name | Order | Citations | PageRank |
---|---|---|---|
Helena F. Deus | 1 | 210 | 13.23 |
Miria C. Correa | 2 | 14 | 0.45 |
Romesh Stanislaus | 3 | 88 | 3.56 |
Maria Miragaia | 4 | 14 | 0.45 |
Wolfgang Maass | 5 | 3717 | 391.51 |
Hermínia de Lencastre | 6 | 21 | 1.03 |
Ronan Fox | 7 | 59 | 4.98 |
Jonas S Almeida | 8 | 731 | 42.25 |