AI Case Studies | Steelhead, Calgary

Internal Tooling

Building a Private Knowledge Graph That Actually Answers Questions

A local-first memory system that ingests your whole document pile, maps the people and companies inside it, and answers questions with citations back to the source files.

1,400+

Documents ingested and searchable locally

Read full case study →

AI Proof of Concept

Automated Buy-Side Intelligence for M&A Deal Sourcing

A data platform that ingests public filings and web sources, resolves them into a unified entity database, and matches firms against deal parameters with a 1 to 100 fit score.

3,190

Deduplicated firms from 8,000+ raw entities

Read full case study →

Functional Prototype

Centralized Intelligence Platform for Venture Capital

A multi-agent platform that automates deal sourcing, company research, investment thesis generation, and founder outreach from a single dashboard.

5-7 min

Investment thesis generation, down from 4-8 hours

Read full case study →

Internal Tooling

Building a Private Knowledge Graph That Actually Answers Questions

The Challenge

Knowledge workers build up a mountain of documents over time. Pitch decks, reports, emails, meeting notes, spreadsheets, screenshots. After a couple of years, there are thousands of files. Folders stop working. Search misses anything that was worded differently than you remember. The information you need exists somewhere in the pile, but you cannot find it in the thirty seconds before a call.

Off-the-shelf tools do not fix this. Notion and Claude Projects do not automatically pull in new files from your inbox or Drive. Enterprise tools like Glean cost thousands per seat and require a vendor to customize. ChatGPT with file uploads forgets everything the moment you close the chat.

What was needed was a private memory system that runs on your own machine, ingests your whole document pile automatically, and lets you ask questions that pull from across all of it.

What We Built

A local system that watches a folder, pulls in every new document automatically, and organizes the contents into a searchable memory. You ask a question in plain language and get an answer that draws from across your entire document pile, with links back to every source it used.

Knowledge Graph application home screen with natural language query input and retrieval mode selector — Ask a question in plain language. The system searches across all of your documents at once.

Every document is scanned for the important stuff: the people, companies, tools, and projects it mentions. Those get organized into a graph so you can see at a glance that three different documents all reference the same person, or that five projects all depend on the same vendor.

Force-directed cross-document entity graph showing document nodes connected by shared people, organizations, tools, and technologies — Every document and the people, companies, and tools inside it, mapped visually.

When you ask a question, the system searches two ways at once. It searches for documents that match the meaning of your question, and it searches for documents that contain the exact words. Then it combines both results and gives you an answer with links back to the source documents so you can verify what it said.

Everything runs locally. No data leaves the machine except for the AI calls that read and summarize text, and those never touch the raw files during search. The system handles plain-language questions about risks, decisions, people, or any topic that appears across the document pile, and answers with citations attached to every claim.

Hybrid retrieval query returning structured risk analysis with inline source citations and timing data — Ask about risks, decisions, or anyone mentioned across the whole pile.

The same approach works for any business drowning in documents: staffing agencies, insurance brokerages, and marketing agencies all deal with the same problem at different scales.

Results

1,400+

Documents ingested and searchable

4,700+

People, companies, and projects extracted into the graph

7,000+

Connections mapped between documents

~$0.02

Cost per document to extract entities

< 1 sec

Typical query response time

100%

Source attribution on every answer

Python FastAPI LanceDB Next.js Gemini Embeddings Claude API Ollama

AI Proof of Concept