Software:Ontotext GraphDB

From HandWiki
Short description: RDF-store
GraphDB
Ontotext GraphDB Logo.png
Developer(s)Ontotext
Stable release
9.9.1 / September 2021 (2021-09) [1]
Operating systemCross-platform
Available inEnglish
TypeDatabase, Triplestore, Graph databases
LicenseGraphDB-Free is free to use. SE and EE are licensed per CPU-Core used. Perpetual and annual subscription models are available.
Websitewww.ontotext.com/products/graphdb

Ontotext GraphDB (previously known as BigOWLIM) is a graph database[2][3] and knowledge discovery[4][5][6] tool compliant with RDF[7] and SPARQL[8] and available as a high-availability cluster. Ontotext GraphDB is used in various European research projects.[9]

As of April 2021, Graph DB is ranked as the 4th most -popular[10] RDF store[11][12] and 6th most-popular Graph DBMS system.[13] Some categorize it as a NoSQL database.[14] In 2014 Ontotext acquired the trademark "GraphDB" from Sones.

As for a typical graph DB, ontologies are an important input for the databases.[15] The underlying idea is a semantic repository.[16]

Architecture

GraphDB is used to store and manage semantic Knowledge Graph data. It is built on top of the RDF4J architecture implemented through RDF4J's Storage and Inference Layer (SAIL). The architecture is made of three main components:

  • The Workbench is a web-based administration tool. The user interface is based on RDF4J Workbench Web Application
  • The Engine consists of a query optimizer, reasoner,[17] storage and plugin manager. The reasoner in GraphDB is Forward chaining with the goal of total materialization.[18] The plugin manager supports user-defined indexes and can be configured dynamically during run-time. These include:
    • RDF Rank, which is an algorithm that identifies the most relevant entities, similar to Google's PageRank by evaluating their interconnectedness
    • GeoSPARQL, which is the standard for geographical linked data. The plugin is able to convert between coordinate reference systems into the default, which OGC specifies as CRS84 format
    • Lucene, which supports full-text search capabilities. This provides a variety of indexing options and the ability to simultaneously use multiple, differently configured indexes in the same query using Apache Lucene, a high-performance, full-featured text search engine
  • The Connectors: The performance of search such as full-text search and faceted search can be vastly improved via the connectors by enabling the implementation by an external component or service. GraphDB has a connector for both well-known open-source search engines, Solr and Elasticsearch.
    • There is also a connector enabling MongoDB integration, providing the scalability and performance advantages.
    • Relational data virtualization (Ontology-Based Data Access, OBDA) is provided by integration of ontop
    • SQL Access over JDBC is provided[19] for traditional analytics tools such as Tableau and PowerBI
    • Kafka Sink Connector[20] for ingesting large amounts of data.
    • GraphQL access to knowledge graphs[21] and semantic search[22] based on Elasticsearch and exposed through GraphQL.

Features and Integrations

According to Ontotext, Graph DB supports:

  • GraphDB uses RDF4J as a library, utilizing its APIs for storage and querying.
  • It supports the GraphQL, SPARQL and SeRQL languages and RDF (e.g., RDF/XML, N3, Turtle) serialization formats.
  • It supports custom reasoning rulesets, as well as RDFS, RDFS-plus, OWL 2 RL and QL.[23]
  • It integrates OpenRefine for the ingestion of tabular data[24] and provides semantic similarity search at the document level.[25]

Uses

Ontotext Graph DB is used in various scientific areas, e.g., Genetics,[26] Healthcare,[27] Data Forensics,[28] Cultural Heritage,[29] Geography,[30] Infrastructure Planning,[31] Civil Engineering,[32] Digital Historiography,[33] Oceanography.[34]

For more examples see "Diverse Uses of a Semantic Graph Database for Knowledge Organization and Research" below.

Commercial clients include BBC Sport,[35][36] Financial Times,[37] Springer Nature,[38] UK Parliament,[39][40] AstraZeneca[41] as well as in the pharmaceutical and finance industries.

Some use cases focus on scalability and large data sizes.[42]

See also

External links

References

  1. "Graph Databases (Technology)" (in en-GB). https://graphdb.ontotext.com/documentation/standard/release-notes.html. 
  2. "Graph Databases (Technology)" (in en-GB). https://www.bloorresearch.com/technology/graph-databases/. 
  3. "Global Graph Database Market by Type, Application, Component, Deployment Type, Industry Vertical & Region - Analysis & Forecast to 2023 - ResearchAndMarkets.com" (in en). 2018-06-28. https://www.businesswire.com/news/home/20180628005581/en/Global-Graph-Database-Market-by-Type-Application-Component-Deployment-Type-Industry-Vertical-Region---Analysis-Forecast-to-2023---ResearchAndMarkets.com. 
  4. "KMWorld AI 50: The Companies Empowering Intelligent Knowledge Management". https://www.kmworld.com/Articles/Editorial/Features/KMWorld-AI-50-The-Companies-Empowering-Intelligent-Knowledge-Management-141554.aspx. 
  5. "Global Semantic Knowledge Discovery Software Market Growth (Status and Outlook) 2019-2024 - Market Research Insights". https://www.mrinsights.biz/report/global-semantic-knowledge-discovery-software-market-growth-status-192862.html. 
  6. Buchmann, Robert (2019). "Model-Aware Software EngineeringA Knowledge-based Approach to Model-Driven Software Engineering". https://www.scitepress.org/papers/2018/66941/66941.pdf. 
  7. Motik, Boris; Nenov, Yavor; Piro, Robert; Horrocks, Ian; Olteanu, Dan (2014-06-19). "Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF Systems" (in en). Proceedings of the AAAI Conference on Artificial Intelligence 28 (1). doi:10.1609/aaai.v28i1.8730. ISSN 2374-3468. https://ojs.aaai.org/index.php/AAAI/article/view/8730. 
  8. "SparqlImplementations - W3C Wiki". https://www.w3.org/wiki/SparqlImplementations. 
  9. "Google Scholar". https://scholar.google.com/scholar?hl=en&as_sdt=0,5&q=graphdb+ontotext&oq=graphDB. 
  10. "DB-Engines Ranking" (in en). https://db-engines.com/en/ranking/rdf+store. 
  11. Guest, CIO Central. "The Hype Around Graph Databases And Why It Matters" (in en). https://www.forbes.com/sites/ciocentral/2015/04/06/the-hype-around-graph-databases-and-why-it-matters/. 
  12. ltd, Research and Markets. "Graph Database Market by Type (RDF and Property Graph), Application (Recommendation Engines, Fraud Detection, Risk and Compliance Management), Component (Tools and Services), Deployment Mode, Industry Vertical, and Region - Global Forecast to 2024" (in english). https://www.researchandmarkets.com/reports/4841770/graph-database-market-by-type-rdf-and-property. 
  13. "GraphDB System Properties". https://db-engines.com/en/system/GraphDB. 
  14. "GraphDB" (in en-GB). https://www.capterra.co.uk/software/157533/graph-db. 
  15. Ledvinka, Martin (2015). "JOPA: Accessing Ontologies in an Object-oriented Way". https://www.scitepress.org/papers/2015/54003/54003.pdf. 
  16. Kiryakov, Atanas (November 2005). "OWLIM—a pragmatic semantic repository for OWL". https://www.researchgate.net/publication/221194880. 
  17. Stoilos, Giorgos; Grau, Bernardo Cuenca; Horrocks, Ian (2010-07-05). "How Incomplete is Your Semantic Web Reasoner?" (in en). Proceedings of the AAAI Conference on Artificial Intelligence 24 (1): 1431–1436. doi:10.1609/aaai.v24i1.7498. ISSN 2374-3468. https://ojs.aaai.org/index.php/AAAI/article/view/7498. 
  18. Kiryakov, Atanas; Ognyanov, Damyan; Manov, Dimitar (2005). "OWLIM – A Pragmatic Semantic Repository for OWL". in Dean, Mike; Guo, Yuanbo; Jun, Woochun et al. (in en). Web Information Systems Engineering – WISE 2005 Workshops. Lecture Notes in Computer Science. 3807. Berlin, Heidelberg: Springer. pp. 182–192. doi:10.1007/11581116_19. ISBN 978-3-540-32287-0. 
  19. "SQL Access over JDBC". Ontotext. https://graphdb.ontotext.com/documentation/10.0/sql-access-over-jdbc.html. 
  20. "Kafka Sink Connector¶". Ontotext. https://graphdb.ontotext.com/documentation/10.0/kafka-sink-connector.html. 
  21. "Semantic Objects: Overview". Ontotext. https://platform.ontotext.com/semantic-objects/. 
  22. "Semantic Search: Overview". Ontotext. https://platform.ontotext.com/semantic-search/. 
  23. "GraphDB Reasoning: predefined rulesets". Ontotext. https://graphdb.ontotext.com/documentation/10.0/reasoning.html#predefined-rulesets. 
  24. "Ontotext Refine: Overview and features". Ontotext. https://platform.ontotext.com/ontorefine/. 
  25. "Semantic similarity searches". Ontotext. https://graphdb.ontotext.com/documentation/10.0/semantic-similarity-searches.html. 
  26. Poncheewin, Wasin; Hermes, Gerben D. A.; van Dam, Jesse C. J.; Koehorst, Jasper J.; Smidt, Hauke; Schaap, Peter J. (2020). "NG-Tax 2.0: A Semantic Framework for High-Throughput Amplicon Analysis" (in English). Frontiers in Genetics 10: 1366. doi:10.3389/fgene.2019.01366. ISSN 1664-8021. PMID 32117417. 
  27. Barisevičius, Gintaras; Coste, Martin; Geleta, David; Juric, Damir; Khodadadi, Mohammad; Stoilos, Giorgos; Zaihrayeu, Ilya (2018). "Supporting Digital Healthcare Services Using Semantic Web Technologies". in Vrandečić, Denny; Bontcheva, Kalina; Suárez-Figueroa, Mari Carmen et al. (in en). The Semantic Web – ISWC 2018. Lecture Notes in Computer Science. 11137. Cham: Springer International Publishing. pp. 291–306. doi:10.1007/978-3-030-00668-6_18. ISBN 978-3-030-00668-6. https://link.springer.com/chapter/10.1007/978-3-030-00668-6_18. 
  28. Zhuhadar, Leyla; Ciampa, Mark (2019-03-01). "Leveraging learning innovations in cognitive computing with massive data sets: Using the offshore Panama papers leak to discover patterns" (in en). Computers in Human Behavior 92: 507–518. doi:10.1016/j.chb.2017.12.013. ISSN 0747-5632. https://www.sciencedirect.com/science/article/abs/pii/S0747563217306933. 
  29. Damiano, Rossana; Lombardo, Vincenzo; Lieto, Antonio; Borra, Davide (2016-07-01). "Exploring cultural heritage repositories with creative intelligence. The Labyrinth 3D system" (in en). Entertainment Computing 16: 41–52. doi:10.1016/j.entcom.2016.05.002. ISSN 1875-9521. https://www.sciencedirect.com/science/article/abs/pii/S1875952116300167. 
  30. Panasiuk, Oleksandra (2019). "Representing GeoData for Tourism with Schema.org". https://schema-tourism.sti2.org/sites/default/files/representing_geodata.pdf. 
  31. Azzam, Amr; Aryan, Peb Ruswono; Cecconi, Alessio; Di Ciccio, Claudio; Ekaputra, Fajar J.; Fernandez Garcia, Javier David; Karampatakis, Sotiris; Kiesling, Elmar et al. (2019), Antonella Longo, Maria Fazio, ed. (in en), The CitySPIN Platform: A CPSS Environment for City-Wide Infrastructures, Bilbao, Spain: CEUR Workshop Proceedings, pp. 57–64, http://ceur-ws.org/Vol-2530/paper8.pdf, retrieved 2021-04-15 
  32. Nundloll, Vatsala; Lamb, Rob; Hankin, Barry; Blair, Gordon (2021-04-01). "A semantic approach to enable data integration for the domain of flood risk management" (in en). Environmental Challenges 3: 100064. doi:10.1016/j.envc.2021.100064. ISSN 2667-0100. Bibcode2021EnvCh...300064N. 
  33. Quaresma, Paulo (2020). "Information Extraction from Historical Texts:a Case Study". http://ceur-ws.org/Vol-2607/short2.pdf. 
  34. Zárate, Marcos; Rosales, Pablo; Braun, Germán; Lewis, Mirtha; Fillottrani, Pablo Rubén; Delrieux, Claudio (2019). "OceanGraph: Some Initial Steps Toward a Oceanographic Knowledge Graph". in Villazón-Terrazas, Boris; Hidalgo-Delgado, Yusniel (in en). Knowledge Graphs and Semantic Web. Communications in Computer and Information Science. 1029. Cham: Springer International Publishing. pp. 33–40. doi:10.1007/978-3-030-21395-4_3. ISBN 978-3-030-21395-4. https://link.springer.com/chapter/10.1007/978-3-030-21395-4_3. 
  35. "BBC - BBC Internet Blog: Sports Refresh: Dynamic Semantic Publishing". https://www.bbc.co.uk/blogs/bbcinternet/2012/04/sports_dynamic_semantic.html. 
  36. "BBC - BBC Internet Blog: BBC World Cup 2010 dynamic semantic publishing". https://www.bbc.co.uk/blogs/bbcinternet/2010/07/bbc_world_cup_2010_dynamic_sem.html. 
  37. "Semantic Technology for online, broadcast and print media" (in en). http://videolectures.net/wims2014_rayfield_semantic_technology/. 
  38. "SciGraph | For Researchers". https://www.springernature.com/gp/researchers/scigraph. 
  39. "Linked Government Data". https://www.nationalarchives.gov.uk/documents/information-management/open-and-linked-data-johnsheridan.ppt. 
  40. "Performance testing a graph database | Parliamentary Digital Service" (in en). https://pds.blog.parliament.uk/2017/12/15/performance-testing-a-graph-database/. 
  41. Anadiotis, George. "Graph databases and RDF: It's a family affair" (in en). https://www.zdnet.com/article/graph-databases-and-rdf-its-a-family-affair/. 
  42. Bishop, Barry (January 2011). "OWLIM: A family of scalable semantic repositories". https://www.researchgate.net/publication/220575516. 
  43. Alexiev, Vladimir (March 2021). "Diverse Uses of a Semantic Graph Database for Knowledge Organization and Research". European Data Conference on Reference Data and Semantics (ENDORSE 2021). https://op.europa.eu/documents/7525478/8087182/ALEXIEV_presentation_Diverse+Uses+of+a+Semantic+Graph+Database+for+Knowledge+Organization+and+Research.pdf. 
  44. Alexiev, Vladimir. "Diverse Uses of Ontotext GraphDB". https://www.youtube.com/watch?v=0q63x2P1V0o&list=PLT5rARDev_rmGr_LJkr7zcI-Qul7yOOHO&index=4&t=4780s. 
  45. Alexiev, Vladimir. "Diverse Uses of Ontotext GraphDB". https://github.com/VladimirAlexiev/ontotext-graphdb-applications. 
  46. "Ontotext-GraphDB". https://www.zotero.org/groups/2744757/ontotext-graphdb.