PuppyGraph is the first and only real time, zero-ETL graph query engine in the market, empowering data teams to query existing relational data stores as a unified graph model that deployed in under 10 minutes, bypassing traditional graph databases' cost, latency, and maintenance hurdles. Capable of scaling with petabytes of data and executing complex 10-hop queries in seconds, PuppyGraph supports use cases from enhancing LLMs with knowledge graphs to fraud detection, cybersecurity and more. Trusted by industry leaders, including Coinbase, AMD, Netskope, Palo Alto Network, eBay, and more.

How does PuppyGraph compare to Neo4j?

Unlike Neo4j, which requires you to load and sync data into its proprietary graph store, PuppyGraph runs directly on your data sources—eliminating ETL, reducing TCO, and enabling faster time-to-value. PuppyGraph also integrates natively with Databricks Unity Catalog, Google BigQuery, and AlloyDB.

What are the performance benefits of PuppyGraph?

PuppyGraph delivers multi-hop traversals in seconds over billions of edges. Real customer stories cite 5-hop queries on 1B+ edges in under 3 seconds.

Does PuppyGraph support my cloud data stack?

Yes. PuppyGraph natively integrates with Databricks Unity Catalog, Google BigQuery, AlloyDB, and AWS, keeping a single governed copy of your data.

How does PuppyGraph handle data governance and security?

PuppyGraph leverages your existing catalog and security (Unity Catalog, BigQuery, AlloyDB), so all graph queries respect your current access controls.

Can PuppyGraph power AI and LLM applications (GraphRAG)?

Yes. PuppyGraph enables Graph-based Retrieval Augmented Generation (GraphRAG) directly on your governed data—providing explainable, multi-hop context for LLMs and enterprise AI.

See all case studies

Knowledge Graph

Ontology

How Baker Hughes Turned Four Years of Fragmented Plant Data Into Root-Cause Answers

Engineers at Baker Hughes built a knowledge graph on PuppyGraph and a team of AI agents to diagnose the equipment failures dragging down Overall Equipment Efficiency across four industrial plants. They asked questions in plain language. The agents wrote the graph queries.

203,845 nodes • 204,684 edges

One connected graph spanning four plants and five datasets.

5 datasets, zero ETL

Unified directly on Databricks Delta tables, with no data movement.

4 years of history (2019–2022)

Incidents, RCAs, work orders, and bad actor records, all queryable in natural language.

‍

At a glance

Organization: Baker Hughes (Houston, Texas)
Sector: Energy and industrial equipment
Use case: OEE improvement, equipment reliability, automated root-cause analysis
Data platform: Databricks (Delta Lake)
Graph layer: PuppyGraph (virtual, zero-ETL)
Query languages: openCypher and Gremlin
Agent LLM: databricks-claude-sonnet-4
Source: SPE-228135-MS

About this case study: PuppyGraph prepared this summary from a technical paper published by the Baker Hughes team. Every figure and technical detail below is drawn from that public paper. To read the full methodology and results, see the original paper: SPE-228135-MS.

The challenge: the answers existed, but no one could reach them fast

Overall Equipment Efficiency (OEE) measures how well a manufacturing operation runs against its full potential, combining three factors: availability (the share of scheduled time equipment is ready to run), performance (how fast it runs versus its designed capacity), and quality (the proportion of good output). A single OEE score points to where productivity is being lost.

The hard part is not the metric. It is finding out why the score is low. As the Baker Hughes team describes it, diagnosing poor OEE means pulling together predictive maintenance records, work orders, timesheets, root cause analyses (RCAs), and event logs. That data is fragmented and siloed across departments and systems. Much of it is unstructured, sitting in free-text maintenance notes and incident reports. It is undermined by sensor errors and manual entry mistakes, and locked inside legacy systems never built for modern analytics.

Working through those large, unstructured datasets by hand is slow, complicated, and error-prone, which delays both identifying and resolving OEE issues. And because multiple factors interact, the real underlying cause is often obscured.

Why the team chose PuppyGraph

The team needed a graph to connect these scattered records, but without standing up and maintaining a separate graph database.

"Although Neo4j is a popular knowledge graph solution, we used PuppyGraph due to its open-source flexibility and seamless integration capabilities."

In the paper's description, PuppyGraph functions as a zero-ETL graph analytics engine that queries data directly from sources like Databricks, without requiring data movement, which reduces onboarding costs and improves performance. It supports openCypher and Gremlin and is compatible with major cloud providers including AWS, Azure, and GCP.

The architectural difference that mattered here: the graph is not stored in a static format. PuppyGraph builds it virtually on top of existing data, such as the Delta tables already in Databricks. Data stays where it lives. PuppyGraph connects to Databricks, maps the tables to nodes and edges, and the schema can be defined either interactively in the UI or by uploading a JSON file.

Figure 1—Schema loaded in PuppyGraph showing connections between plant, functional location, incident, RCA, bad actor, and work order history datasets. Note that the current version of PuppyGraph UI displays only a single node per dataset type, however in reality there are a total of 203,845 nodes and 204,684 edges in our knowledge graph.

How they built it

Five datasets were combined across four plants and four years of operations (2019–2022): incident logs, RCA reports, bad actor analysis, work order history, and functional location details. Each one carries a distinct view of plant health:

Incident logs capture unplanned downtime events, time-stamped and categorized by loss type, annotated with production loss (in metric tons), financial loss (in USD), and a text description of what happened.
RCA reports add root-cause context for high-priority incidents, with reasons, severity ratings, and corrective or preventive actions.
Bad actor records flag equipment that fails repeatedly and drives high production loss and maintenance cost, with the cost of unreliability and resolution status.
Work orders document maintenance activity, including execution time, hours worked, and functional location.

The connective tissue is the functional location field, a common identifier that links a failure in the incident log to its RCA entry, its related work orders, and any repeat appearances on the bad actor list. Modeled as a hub-and-spoke graph with functional location at the center, the result is a single connected knowledge graph of 203,845 nodes and 204,684 edges, in effect a digital twin of the plants spanning equipment specifications, performance history, failure records, and work order details.

On top of that graph, the team deployed a multi-agent AI workflow. The central capability is text-to-Cypher: each agent converts a natural-language question into a Cypher query, runs it against the graph, and returns an answer. The agents are specialized, including plant-level, asset-level, incident, RCA, work order, and bad actor agents, plus a summarizer agent that condenses the findings and a visualization agent that writes and runs Python to generate plots. Each agent is guided by a structured prompt and returns its reasoning in THINKING, PLAN, and CONCLUSION sections, so the path from question to answer is auditable. In production, the workflow can be triggered automatically whenever an anomaly (such as a significant drop in production quantity) is logged to the graph.

What the agents found

Each figure below is a reliability loss the agents surfaced and quantified on their own, from a single plain-language question, across records that previously had to be read by hand.

Availability, from a single question. Asked about production losses between July and September 2021, the work order agent analyzed the work orders from that window and returned the most problematic equipment, the most affected functional locations, and the common maintenance types (most were external visual inspections), along with recommended next steps for each.

High-loss equipment, ranked by plant. From a single natural-language instruction:

"For each plant, can you give me the top 3 categories of equipment types that led to highest RCA losses? Break down these losses by their corresponding incident ratings."

the RCA agent aggregated across the RCA, incident, and asset datasets and ranked the results. Plant C surfaced as the highest-risk site, with over USD 71 million in RCA-related losses, led by compressors, fired pressure vessels, and unfired heat exchangers. Across all four plants, compressors and pressure vessels were consistently the most significant contributors.

Bad actors, by cost of unreliability. A second prompt:

"For each plant, can you provide the number of bad actors categorized by equipment type along with their associated cost of unreliability?"

Plant C again stood out with 18 recurring underperforming assets and roughly USD 532 million in cost of unreliability. Ranked across plants, compressors (USD 351 million) and fired equipment (USD 272 million) dominated the total.

The throughline: a few asset classes, compressors and pressure vessels chief among them, drive a disproportionate share of unplanned downtime and reliability cost. That is exactly the kind of prioritization an OEE program needs, and the agents produced it from plain questions rather than weeks of manual analysis.

What's next

The Baker Hughes team frames this as a foundation. Their stated next steps include extending the system toward prescriptive analytics, integrating additional data modalities such as images, and strengthening security and access controls. The broader thesis: pairing agentic AI with a robust, extensible knowledge graph makes root-cause analysis scalable, explainable, and fast enough to keep pace with how plants actually run.

Read the full research

This case study summarizes work published by the Baker Hughes team (N. Sengar, A. Jain, R. Elsinga, P. Rai, and A. Anand) as SPE-228135-MS, presented at the SPE Annual Technical Conference and Exhibition, Houston, 2025. All figures and technical details above are drawn from the public paper.

Read it here: https://doi.org/10.2118/228135-MS

Try it yourself

Download the forever-free PuppyGraph Developer Edition, or book a demo with our graph experts today.