Tech Accelerator What is a cloud database? An in-depth cloud DBMS guide

Definition

graph database

TechTarget Contributor

By

TechTarget Contributor

A graph database, also referred to as a semantic database, is a software application designed to store, query and modify network graphs. A network graph is a visual construct that consists of nodes and edges. Each node represents an entity (such as a person) and each edge represents a connection or relationship between two nodes.

Graph databases have been around in some variation for along time. For example, a family tree is a very simple graph database.

The concept of using databases to map relationships digitally started seeing popular usage in business around 2015 when increased compute power, in-memory computing, and agreed-upon standards moved the concept from academics to real-world uses in business and enterprise computing.

Graph databases are well-suited for analyzing interconnections, which is why there has been a lot of interest in using graph databases to mine data from social media. Graph databases are also useful for working with data in business disciplines that involve complex relationships and dynamic schema, such as supply chain management, identifying the source of an IP telephony issue and creating "customers who bought this also looked at..." recommendation engines.

The concept behind graphing a database is often credited to 18th-century mathematician Leonhard Euler.

The structure of a graph database

Traditionally classified as a type of NoSQL database, graph databases are sometimes referred to as triple stores. That's because this type of database uses a special index that stores information about nodes, edges and the relationship between them in groups of three.

This article is part of

What is a cloud database? An in-depth cloud DBMS guide

A triple, which may also be referred to as an assertion, has three main fields: a subject, a predicate and an object. Each subject, predicate or object is represented by a unique resource identifier (URI).

How information is indexed

In a triple store, the first field in the database holds the URI for the subject, the second field holds the URI for the predicate and the third field holds a URI for the object. While there are a number of different strategies that graph databases may use for storing triples, most use an index that abbreviates the three primary fields to {?s, ?p, ?o}.

For example, if the visual construct for a graph is given as follows:

Nodes and edges

Then the index will look like this:

Row	?s	?p	?o
1	:Bob	:marriedTo	:Julie
2	:Bob	:brotherOf	:Steve
3	:Bob	:listensTo	:RockMusic
4	:Julie	:listensTo	:RockMusic
5	:Julie	:sisterInLawTo	:Steve
6	:Jim	:worksFor	:IBM

How information in a graph database is queried

Each triple in a graph database only gets stored once in the index. Just like relational databases, it's a simple process to do a straight lookup query in a graph database.

If the query is for what information is known about Bob, the indexer programming only needs to search rows 1-3 of the database.

The real power and speed of a graph database comes from indexing combinations of triples. Here's are a few examples:

If the query is for who Bob is married to, the indexer will look for the predicate :marriedTo in rows 1-3 and then retrieve the matching object. (Bob is married to Julie.)
If the query is to identify everyone who listens to the same kind of music as Bob, the indexer will first ask { :Bob :listensTo ?o } and identify :RockMusic as the object.

In the second query, the results will return :RockMusic in rows 3 and 4. The subject in row 3 is Bob himself, so whoever is the subject in row 4 will be the other person who listens to rock music. (It turns out to be Julie, Bob's wife.)

Types of graph databases

Historically, graph databases have been divided into two categories -- property graphs that simply support nodes and edges, and knowledge graphs like the one above that can focus on the semantic aspects of data and store information in triples. Generally speaking, indexing strategies for both types are similar.

It is expected that over time, knowledge graphs and property graphs will merge and the architectural distinctions between these two types of graph databases will fade away.

Use cases for graph databases

Current use cases for graph databases include the following:

Allow data analysts to federate data sets without having to create and run complex queries that join combinations of tables together, as in the relational database model.
Help developers create the back end for voice assistants by mapping possible user questions to correct answers.
Identify clusters of events that are connected in unusual ways to detect fraud.
Examine direct connections to identify potential indirect connections for recommendation engines.

Future of graph databases

Graphs databases are expected to play a major role in areas as diverse as machine learning, Bayesian analysis, data science and artificial intelligence, as well as helping to manage enterprise data and data interchange, over the next decade.

One of the most significant impacts on this type of database will be improvements in data federation. When knowledge graphs can be easily federated, one database will be able to determine that it needs data it doesn't have and automatically retrieve that data from other knowledge graph. With this ability, it is likely that federation will help developers create blockchains that use relevant metadata to authenticate transactions in banking, finance, voting and smart contracts.

This was last updated in March 2020

Continue Reading About graph database

LinkedIn manages a graph database that digitally maps the global economy to connect talent with opportunity at massive scale.

Graph analytics platform becomes compliance and fraud detection tool

Why Facebook and the NSA love graph databases

Step aside Facebook, graph database use cases go beyond social media

Graph database vs. relational database

network scanning
Network scanning is a procedure for identifying active devices on a network by employing a feature or features in the network ...
networking (computer)
Networking, also known as computer networking, is the practice of transporting and exchanging data between nodes over a shared ...
What is SD-WAN (software-defined WAN)? Ultimate guide
Software-defined WAN is a technology that uses software-defined networking concepts to distribute network traffic across a wide ...

identity management (ID management)
Identity management (ID management) is the organizational process for ensuring individuals have the appropriate access to ...
fraud detection
Fraud detection is a set of activities undertaken to prevent money or property from being obtained through false pretenses.
single sign-on (SSO)
Single sign-on (SSO) is a session and user authentication service that permits a user to use one set of login credentials -- for ...

CIO

IT budget
IT budget is the amount of money spent on an organization's information technology systems and services. It includes compensation...
project scope
Project scope is the part of project planning that involves determining and documenting a list of specific project goals, ...
core competencies
For any organization, its core competencies refer to the capabilities, knowledge, skills and resources that constitute its '...

recruitment
Recruitment is the process of finding, screening, hiring and onboarding qualified job candidates.
Workday
Workday is a cloud-based software vendor that specializes in human capital management (HCM) and financial management applications.
recruitment management system (RMS)
A recruitment management system (RMS) is a set of tools designed to manage the employee recruiting and hiring process. It might ...

Customer Experience

martech (marketing technology)
Martech (marketing technology) refers to the integration of software tools, platforms, and applications designed to streamline ...
transactional marketing
Transactional marketing is a business strategy that focuses on single, point-of-sale transactions.
customer profiling
Customer profiling is the detailed and systematic process of constructing a clear portrait of a company's ideal customer by ...

Close