Blogs

The Misunderstood World of Knowledge Graphs

July 21, 2025
5 min read
Vishakha Gupta
Vishakha Gupta
The Misunderstood World of Knowledge Graphs

I think in terms of graphs. I see a person and I retrieve what I know about them like key,value attributes. I see connections among entities in my life like how I know a person or where did I meet them and so on, literally with imaginary lines drawing from A to B to C and so on. For the longest time I thought everyone was like that and we were being forced to map our connected brain objects to the relational world due to the popularity of SQL! In the last few years, I have realized, shockingly, it's not a very large group of people who think in terms of graphs. Our brains have generally been molded to be tabular! In fact, even though graphs are everywhere, from social networks to recommendation engines, they remain one of the most misunderstood data paradigms. While AI and connected systems cry out for structure, context, along with semantics, we continue to force relationships into flat tables and rigid joins. Why did graphs and more importantly graph databases become so confusing and how do we make them simple?

Key Terms in the Graph Data World

Let’s start with decoding some foundational terminology:

  • Knowledge Graphs are a construct that allows us to organize and reason from structured information about entities and their relationships, often used to enable semantic understanding e.g. if Person A is married to B and C manages B’s finances, C must be managing A’s finances as well.
  • Property Graphs allow us to represent entities e.g person A, person B, and relationships among them (like A is married to B) as nodes and edges, both of which can hold key-value pairs, providing a way to build a knowledge graph.
  • Triple Stores are systems that store data in subject-predicate-object triples, which are essential for RDF-based systems. This approach makes them generic and standardized, and also facilitates the creation of knowledge graphs..
  • Ontology defines a formal structure of domain concepts and relationships, acting as a schema for semantic understanding.
  • Taxonomy classifies information hierarchically, helping organize entities based on shared characteristics.

Graphs Are Easy, So Why Are We Still Pushing Knowledge into Tables?

Relational databases made sense in the age of forms, invoices, and bank ledgers, but when today’s systems must understand language, images, events, and their interconnected contexts, tabular formats start to crack. Graphs represent relationships natively, and align more naturally with how human and machine intelligence perceive the world. 

Still, myths about graphs persist and I feel personally responsible to bust them.

⚠ Misconception 1: Relational Schemas Are Flexible; Graph Schemas Need Preparation

Many believe relational schemas offer agility, and that graph schemas are rigid or overcomplicated. The reality? Graphs allow emergent structure; relationships and entities can evolve without destructive migrations. Schema-less or schema-light graph databases let you build incrementally while preserving semantic richness. Flexibility isn’t just possible, it’s native. In fact, we have occasionally run into the opposite problem where people who understand graphs and have dabbled in graph databases find them too flexible and need to build restrictions into the graph loaders, which can actually be accomplished easily as well.

🐱 Misconception 2: Graph Databases Are Slow

“Graphs are too slow for production” is a dated view, stemming from early implementations, especially from Java-based implementations and disk-based traversals. It’s very much possible today to build memory optimized graph databases like what ApertureDB provides, for speed with:

  • In-memory data structures and cache-optimized traversals with memory first design
  • Index-free adjacency to enable fast traversal over connections to explore neighborhoods of broader information
  • Parallel query execution

And with hybrid graph+vector systems, semantic search and reasoning can be near real-time, even across multimodal data (we have measured 15msec lookup time for a billion scale graph with ApertureDB).

📈 Misconception 3: Graph Databases Don’t Scale

Scale is often equated with horizontal sharding and billions of rows but graph databases, when built well, can easily handle:

  • Billions of entities and edges
  • Distributed storage and compute, especially when graph patterns and edge distributions are correctly managed
  • Query patterns optimized for traversal

Modern graph platforms can scale contextually, connecting depth and breadth without sacrificing performance.

Common Questions Surrounding Graph Construction

Even as graphs gain traction, many users still ask: Where do I start? What goes into a graph? Can I build one automatically? Let’s tackle the most common questions we hear.

📩 What Goes in my Knowledge Graph?

At its core, a graph stores entities (nodes) and relationships (edges). But what qualifies?

  • Entities: Person, product, document, image, sensor, event - anything with identity. 
  • Relationships: Friend-of, part-of, located-in, derived-from, similar-to, embeddings-of -  anything that connects or contextualizes.

You can also attach properties or attributes to both nodes and edges: timestamps, labels, and more - basically, any metadata about whatever is being represented in the graph. The goal is to capture structure and semantics, not just raw data. Most graph databases make it easy to search by the type of entity and value of attributes e.g. “find any person who was born after 2000” or by their relationship to each other e.g. “find all friends-of Kara”.

đŸ§© Users Already Have Graph Data

Here’s the twist: most users already have graph-shaped data, they just don’t call it that.

  • A customer table linked to orders? That’s a graph.
  • A document referencing other documents? Graph.
  • A multimodal dataset with images, captions, and tags? Definitely a graph.

The challenge isn’t creating graphs from scratch, it’s recognizing and unlocking the graph that’s already there. Once you map the existing data to a graph, as new data is made available, you can continue to add it to the graph database like with any other database. Over time, with the added relations and entities, you start to assemble your "knowledge" base that can then be queried for deeper insights e.g. does a person have a big family by counting family relations originating from that person's node in the graph.

đŸ€– Can You Generate a Graph for Me?

Yes
 and increasingly, Large Language Models or LLMs and AI tools can help automate graph construction.

  • From structured data: infer entities and relationships from tables, logs, or APIs.
  • From unstructured data: extract concepts and links from text, images, or video.
  • From multimodal sources: unify disparate modalities into a coherent graph.

Overall, LLMs are becoming powerful allies in graph construction by assisting with:

  • Entity extraction: Identify key concepts from text and other data types.
  • Relationship inference: Predict how entities relate based on context.
  • Schema suggestion: Recommend graph structures based on domain knowledge.
  • Query generation: Translate natural language into graph queries.

LLMs don’t replace graph design, they accelerate and augment it. This can be tricky to get right, which is where entity resolution is a major area of research in itself.

🔄 Can I Map Any Objects in This Graph?

Yes, graphs are schema-flexible and modality-agnostic. You can map:

  • Text, images, audio, video
  • Structured records and logs
  • Embeddings and model outputs
  • External APIs and knowledge bases

If it has identity and context, it can live in a graph. The key is defining them in a manner that matters to your application.

A note on what it means to map multimodal objects like text, images, PDFs, audio, video and so on: typically, in databases like ApertureDB, it means introducing a node that represents those data types and linking to the corresponding files in storage. You can of course choose to embed the content as an attribute of the node but that requires some accessibility optimizations. However, with all the metadata captured in the graph node representing the multimodal object, it's quite feasible to search fast and access them efficiently and at scale. It also opens up the possibility of representing derived objects like bounding boxes for images, clips or frames for videos, or paragraphs for documents as connected nodes in the graph of multimodal data.

Clearing the Air, Simplifying Graphs

When you need a large team to create a wannabe graph on top of relational or key-value databases (remember Tao on MySQL?), or when your data storage fits naturally in a graph but you are forced to spend resources in generating relational tables from it to save folks from dealing with graphs, it begs the question, why not migrate to a graph database and invest in removing these misconceptions about them! 

Graphs aren't exotic. They’re just data plus context. And as AI systems become more perceptual, relational, and multimodal, graph-native thinking will be key.

It’s not about replacing relational databases. It’s about knowing when and where to shift paradigms, from flat to dimensional, from records to relationships, from storage to contextual memory.

Related Blogs

What Does Multimodality Truly Mean For AI?
Blogs
What Does Multimodality Truly Mean For AI?
For human quality AI or better, applications based on classic ML to Gen AI to AI agents, will have to be based on multimodal data since we, as humans, process a combination of text, voice, imagery to, relationships to answer questions or decide what we want to do. We explore what that really means.
Read More
Watch Now
Industry Experts
Your Smart  AI Agent Needs A Multimodal Brain
Blogs
Your Smart AI Agent Needs A Multimodal Brain
Smart AI agents need more than text to truly act like humans—they need unified memory across text, images, video, audio, and metadata. Part 2 of this 3 part series blog series explains how a purpose-built multimodal database like ApertureDB delivers that memory, enabling modern AI agents to perceive, reason, and act with real context and speed.
Read More
Watch Now
Applied
Automating Knowledge Graph Creation with Gemini and ApertureDB - Part 2
Blogs
Automating Knowledge Graph Creation with Gemini and ApertureDB - Part 2
Part 2 of the tutorial walks you through extracting relationships between entities using Gemini 2.5 and building a fully connected, interactive knowledge graph in ApertureDB. It also covers visualizing the graph and highlights real-world applications in search, education, and AI pipelines.
Read More
Watch Now
Applied
Automating Knowledge Graph Creation with Gemini and ApertureDB - Part 1
Blogs
Automating Knowledge Graph Creation with Gemini and ApertureDB - Part 1
This blog shows how to build a knowledge graph using ApertureDB and Gemini 2.5 Flash to power smarter RAG systems. Part 1 covers extracting and storing entities, enabling real-world use cases like semantic search and AI-powered customer support.
Read More
Watch Now
Applied
Building Real World RAG-based Applications with ApertureDB
Blogs
Building Real World RAG-based Applications with ApertureDB
Combining different AI technologies, such as LLMs, embedding models, and a database like ApertureDB that is purpose-built for multimodal AI, can significantly enhance the ability to retrieve and generate relevant content.
Read More
Managing Visual Data for Machine Learning and Data Science. Painlessly.
Blogs
Managing Visual Data for Machine Learning and Data Science. Painlessly.
Visual data or image/video data is growing fast. ApertureDB is a unique database...
Read More
What’s in Your Visual Dataset?
Blogs
What’s in Your Visual Dataset?
CV/ML users need to find, analyze, pre-process as needed; and to visualize their images and videos along with any metadata easily...
Read More
Transforming Retail and Ecommerce with Multimodal AI
Blogs
Transforming Retail and Ecommerce with Multimodal AI
Multimodal AI can boost retail sales by enabling better user experience at lower cost but needs the right infrastructure...
Read More
Vector Databases and Beyond for Multimodal AI: A Beginner's Guide Part 1
Blogs
Vector Databases and Beyond for Multimodal AI: A Beginner's Guide Part 1
Multimodal AI, vector databases, large language models (LLMs)...
Read More
How a Purpose-Built Database for Multimodal AI Can Save You Time and Money
Blogs
How a Purpose-Built Database for Multimodal AI Can Save You Time and Money
With extensive data systems needed for modern applications, costs...
Read More
Minute-Made Data Preparation with ApertureDB
Blogs
Minute-Made Data Preparation with ApertureDB
Working with visual data (images, videos) and its metadata is no picnic...
Read More
Why Do We Need A Purpose-Built Database For Multimodal Data?
Blogs
Why Do We Need A Purpose-Built Database For Multimodal Data?
Recently, data engineering and management has grown difficult for companies building modern applications...
Read More
Building a Specialized Database for Analytics on Images and Videos
Blogs
Building a Specialized Database for Analytics on Images and Videos
ApertureDB is a database for visual data such as images, videos, embeddings and associated metadata like annotations, purpose-built for...
Read More
Vector Databases and Beyond for Multimodal AI: A Beginner's Guide Part 2
Blogs
Vector Databases and Beyond for Multimodal AI: A Beginner's Guide Part 2
Multimodal AI, vector databases, large language models (LLMs)...
Read More
Challenges and Triumphs: Multimodal AI in Life Sciences
Blogs
Challenges and Triumphs: Multimodal AI in Life Sciences
AI presents a new and unparalleled transformational opportunity for the life sciences sector...
Read More
Your Multimodal Data Is Constantly Evolving - How Bad Can It Get?
Blogs
Your Multimodal Data Is Constantly Evolving - How Bad Can It Get?
The data landscape has dramatically changed in the last two decades...
Read More
Can A RAG Chatbot Really Improve Content?
Blogs
Can A RAG Chatbot Really Improve Content?
We asked our chatbot questions like "Can ApertureDB store pdfs?" and the answer it gave..
Read More
ApertureDB Now Available on DockerHub
Blogs
ApertureDB Now Available on DockerHub
Getting started with ApertureDB has never been easier or safer...
Read More
Are Vector Databases Enough for Visual Data Use Cases?
Blogs
Are Vector Databases Enough for Visual Data Use Cases?
ApertureDB vector search and classification functionality is offered as part of our unified API defined to...
Read More
Accelerate Industrial and Visual Inspection with Multimodal AI
Blogs
Accelerate Industrial and Visual Inspection with Multimodal AI
From worker safety to detecting product defects to overall quality control, industrial and visual inspection plays a crucial role...
Read More
ApertureDB 2.0: Redefining Visual Data Management for AI
Blogs
ApertureDB 2.0: Redefining Visual Data Management for AI
A key to solving Visual AI challenges is to bring together the key learnings of...
Read More

Ready to Accelerate your AI Workflows?

Unlock 10X productivity and simplify multimodal AI data management with ApertureDB—try it for free or schedule a demo today!

Stay Connected:
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.