An Approach for Schema Extraction of NoSQL Graph Databases
Abstrak
Currently, a large volume of heterogeneous data is generated and consumed by several classes of applications, which raise a new family of database models called NoSQL. NoSQL graph databases is a member of this family. They provide high scalability and are schemaless, i.e., they do not require an implicit schema such as relational databases. However, the knowledge of how data is structured may be of great importance for data integration or data analysis processes. There are some works in the literature that extract the schema from graph structures or graph-based data sources. Different from them, this work proposes a comprehensive approach that consider all the common NoSQL database graph data model concepts, and generates a schema in the recent JSON Schema recommendation. Experimental evaluations show that our solution generates a suitable schema representation with a linear complexity.
Artikel Ilmiah Terkait
Veronica Santos Bruno Cuconato
24 Desember 2024
Graphs are the most suitable structures for modeling objects and interactions in applications where component inter-connectivity is a key feature. There has been increased interest in graphs to represent domains such as social networks, web site link structures, and biology. Graph stores recently rose to prominence along the NoSQL movement. In this work we will focus on NOSQL graph databases, describing their peculiarities that sets them apart from other data storage and management solutions, and how they differ among themselves. We will also analyze in-depth two different graph database management systems - AllegroGraph and Neo4j that uses the most popular graph models used by NoSQL stores in practice: the resource description framework (RDF) and the labeled property graph (LPG), respectively.
Roshan Chitrakar Hira Lal Bhandari
2024
With the proliferation of data, relational database management systems face the challenge of managing big volumes of data because of the absence of dynamic data models, high functioning, high scalability, high availability, and low cost. The NoSQL technology has the features which RDBMS lacks. Therefore, many organizations are migrating from RDBMS to NoSQL databases. The migration from RDBMS to NoSQL is not always effective if the data is lost, if the data is replicated while transmission, and if the performance is degraded. For this reason, enhancement of transformation algorithm proposed by Shabana Ramzan et al. has been performed to implement the methodology which is used to successfully migrate data from structured and normalized SQL database MySQL to NoSQL graph database Neo4j. SchemaCrawler engine has been used to metamodeling from source to destination database and execute the transformation rules. This study has also evaluated and validated the performance of the enhanced algorithm after successful migration of data. The NoSQL graph database Neo4j handles the most common form of data that are unstructured, semi-structured, and structured as well as they support the ACID property of relational databases. This feature is not supported by other NoSQL databases such as Oracle NoSQL, MongoDB, and HBase databases.
Robert Pavliš
20 Mei 2024
As the global volume of data continues to rise at an unprecedented rate, the challenges of storing and analyzing data are becoming more and more highlighted. This is especially apparent when the data are heavily interconnected. The traditional methods of storing and analyzing data such as relational databases often encounter difficulties when dealing with large amounts of data and this is even more pronounced when the data exhibits intricate interconnections. This paper examines graph databases as an alternative to relational databases in an interconnected Big Data environment. It will also show the theoretical basis behind graph databases and how they outperform relational databases in such an environment, but also why they are better suited for this kind of environment than other NoSQL alternatives. A state of the art in graph databases and how they compare to relational databases in various scenarios will also be presented in this paper.
R. Mello Telmo Henrique Valverde da Silva
4 Oktober 2021
Several application domains hold highly connected data, like supply chain and social network. In this context, NoSQL graph databases raise as a promising solution since relationships are first class citizens in their data model. Nevertheless, a traditional database design methodology initially defines a conceptual schema of the domain data, and the Enhanced Entity-Relationship (EER) model is a common tool. This paper presents a rule-based conversion process from an EER schema to Neo4j schema constraints, as Neo4j is the most representative NoSQL graph database management system with an expressive data model. Different from related work, our conversion process deals with all EER model concepts and generates rules for ensuring schema constraints through a set of Cypher instructions ready to run into a Neo4j database instance, as Neo4J is a schemaless system, and it is not possible to create a schema a priori. We also present an experimental evaluation that demonstrates the viability of our process in terms of performance.
Gianina Mihai
1 Desember 2020
Relational databases have not been the only solution for the applications that use them for years. New applications must deal with large volumes of complex, distributed data of different types, whose volume increases very rapidly. In these applications, relational databases are supplemented with several new generic data models referred to as NoSQL. In this article we shall present the main types and characteristics of NoSQL data models and we shall make, both a detailed and a synthetic, comparison between the relational model and the NoSQL data models.
Daftar Referensi
0 referensiTidak ada referensi ditemukan.
Artikel yang Mensitasi
0 sitasiTidak ada artikel yang mensitasi.