Open Source · Self-Hosted · Private by Design

Turn Your File Storage into a
Private AI Knowledge Base

ZettaBrain brings AI-powered document search to your own infrastructure — no cloud, no API keys, no data leaving your server. Start with a single user or deploy for your whole organisation.

PowerShell — run as Administrator

No API keys required
No data leaves your server
Works with existing NFS / SMB storage
Runs on CPU or GPU
Backed by:
Inception Partner
Activate Program
for Startups
Startups Program
Startup Program
Two Products

Private AI for Individuals
and Organisations

ZettaBrain RAG for individuals and small teams. ZettaBrain Teams for organisations that need per-team isolation, Active Directory SSO, and audit logging.

Open Source · Self-Hosted

ZettaBrain RAG

Ask questions in plain English, get sourced answers drawn from your own files — running entirely on your own server. Every response shows the exact document chunks it was drawn from so you can verify the source.

  • Ingest PDF, DOCX, TXT, and Markdown files
  • Five-stage hybrid retrieval (keyword + semantic + re-ranking)
  • Secure HTTPS web interface and interactive CLI
  • Works with local disk, NFS, SMB, and S3-compatible storage
  • Auto-detects NVIDIA, AMD, and Apple Silicon GPUs
  • Powered by Ollama — llama3, mistral, qwen run locally
  • No API keys, no cloud accounts, no egress costs
  • One-line install — up and running the same day
curl -fsSL https://zettabrain.app/install.sh | sudo bash
Read the Docs →
Open Source · Self-Hosted · Multi-Tenant

ZettaBrain Teams

Multi-tenant RAG server for organisations. Each team gets its own isolated document library with strict data boundaries — combined with Active Directory SSO, audit logging, and a full admin interface.

  • Per-team document isolation (vector store + BM25)
  • Active Directory / LDAP single sign-on
  • Hybrid retrieval — semantic + keyword + FlashRank re-ranking
  • Web interface for end users and admin panel for IT
  • Audit logging with CSV export
  • Import AD users directly into teams
  • Powered by Ollama — llama3, mistral, qwen run locally
  • No API keys, no cloud accounts, no egress costs
pipx install zettabrain-teams && sudo zettabrain-teams-setup
Read the Docs →
Retrieval Architecture

How ZettaBrain Answers
Your Questions

Every query passes through a five-stage pipeline that combines keyword and semantic search before sending only the most relevant context to the local LLM.

01
📂
Document Ingestion
PDF, DOCX, TXT, MD from local disk, NFS, SMB, or S3
02
✂️
Adaptive Chunking
Chunk size tuned per document type and text density
03
🔢
Local Embedding
nomic-embed-text runs via Ollama — no cloud calls
04
🔍
Hybrid Search
BM25 keyword + MMR semantic, merged and deduplicated
05
🏅
Re-Ranking + LLM
FlashRank cross-encoder picks best chunks before the LLM responds
BM25 KEYWORD SEARCH
Exact Term Matching
Catches precise phrases, codes, and names that pure vector search can miss — especially useful in legal and compliance documents.
MMR SEMANTIC SEARCH
Diversity + Relevance
Maximum Marginal Relevance via ChromaDB retrieves chunks that are topically relevant without being repetitive.
CROSS-ENCODER RE-RANKING
FlashRank Scoring
ms-marco-MiniLM-L-12-v2 scores every candidate chunk against the actual query. Only the best context reaches the model.
TRANSPARENT SOURCING
See Where Answers Come From
Every answer shows the document chunks it was drawn from. Type sources in the CLI at any time to inspect them.
Who It's For

Built for Teams That Can't
Send Files to the Cloud

ZettaBrain is useful for any organization that manages sensitive documents and needs a way to search and query them — without routing data through a third-party API.

⚖️

Legal & Compliance

Law firms and in-house legal teams work with confidential contracts, case files, and regulatory documents. ZettaBrain lets staff ask plain-language questions across all of it — without sending a single file to a cloud AI service.

Contract reviewCase researchRegulatory Q&A
🏥

Healthcare

Clinical teams and administrators manage sensitive patient documentation, clinical guidelines, and compliance records that cannot be processed by external AI services. ZettaBrain runs entirely inside your own infrastructure.

Clinical protocolsPolicy manualsStaff knowledge base
🏦

Financial Services

Banks, asset managers, and insurers need fast access to regulatory filings, internal policies, and client documentation — and they need that access to stay inside their security perimeter.

Regulatory filingsRisk documentationAudit trails
🏛️

Government & Defense

Government agencies and defense organizations often operate in air-gapped or restricted environments where cloud AI tools are simply not an option. ZettaBrain can run with no outbound internet connectivity at all.

Air-gapped deployInternal policy searchOffline operation
🔬

Research & Education

Research institutions accumulate large bodies of papers, lab reports, and internal documentation. ZettaBrain helps researchers find answers across their own corpus without depending on generic public AI tools.

Literature searchLab documentationGrant documents
🏢

Enterprise IT & Operations

IT and operations teams maintain large volumes of runbooks, architecture docs, and vendor manuals on shared drives. ZettaBrain turns that NFS or SMB share into a knowledge base your team can query in plain English.

RunbooksArchitecture docsIncident response
What Makes It Different

Private by Design,
Practical by Default

Most document AI tools require cloud APIs and managed services. ZettaBrain is built around the opposite assumption.

🔒

No Cloud Dependency

The embedding model, the LLM, and the vector store all run on your machine. No OpenAI keys, no Anthropic keys, no egress charges — and no risk of your documents appearing in someone else's training data.

🗄️

Works With Storage You Already Have

ZettaBrain connects directly to your NFS mounts or SMB shares. Your files stay exactly where they are — there's no migration, no upload step, no new storage platform to manage.

📋

Answers Come With Sources

Every response includes the specific document chunks it was based on. Users can check the original file — important for legal, compliance, and any context where accuracy needs to be verifiable.

⚙️

Straightforward to Install

One-line installer handles OS detection, Python, Ollama, and model download on Linux, macOS, and Windows. Most teams are up and running the same day.

💻

Flexible Hardware Requirements

Runs on CPU-only hardware (a smaller model is recommended), and accelerates automatically when NVIDIA, AMD, or Apple Silicon GPUs are detected. The wizard recommends the right model for what you have.

Platform Compatibility

Runs Where Your
Infrastructure Lives

One installer — detects your OS and package manager automatically. No manual steps for each platform.

Ubuntu

Ubuntu

Ubuntu 22.04+ — full GPU support, systemd service, apt-based install

Red Hat

Red Hat Linux

RHEL 8 / 9 / 10 — DNF / YUM package manager, systemd service

Apple

macOS

Apple Silicon (M1/M2/M3) and Intel — Metal GPU acceleration

Windows

Windows

Windows 10/11 and Server 2016+ — PowerShell + winget

🖥️

GPU Acceleration

NVIDIA CUDA, AMD ROCm, Apple Silicon Metal — auto-detected

Storage: Local disk · NFS · SMB/Samba · AWS S3 · MinIO · Ceph

Our Team

The People Behind ZettaBrain

Built by engineers with deep roots in enterprise storage and AI infrastructure at Amazon Web Services, NetApp, and Hewlett-Packard.

Olajide Shobowale
Olajide Shobowale
Founder & CEO

A globally recognized storage systems engineer whose career spans Amazon Web Services, NetApp, and Hewlett-Packard. Olajide has built breakthrough storage technologies serving millions of users worldwide — from intelligent data tagging solutions to hybrid storage optimizations that delivered significant cost reductions and efficiency gains for enterprise customers.

Author of "Intelligent Data Management And Security" and a recognized thought leader in major cloud storage technical communities. Multiple industry awards for technical excellence and leadership. Advanced degrees in Computing and Forensics Information Technology.

Enterprise Storage Cloud Infrastructure AWS NetApp Fortune 500
Tife Obideyi
Tife Obideyi
Co-Founder & CTO

A seasoned full-stack engineer who bridges complex storage infrastructure with enterprise usability. At ZettaBrain, Tife architects the user-facing layer — real-time performance monitoring dashboards, resource allocation interfaces, and management consoles that make sophisticated storage technology accessible to both technical teams and decision-makers.

Expertise in React, TypeScript, and GraphQL, with a track record in large-scale monorepo migrations, CI/CD pipeline optimization, and data-intensive visualization. Certified AWS Solutions Architect with hands-on experience in secure system design and PCI DSS compliance.

React / TypeScript GraphQL AWS Solutions Architect Cybersecurity PCI DSS
Bolarinwa Shobowale
Bolarinwa Shobowale
Co-Founder & Head of Business Development

Brings over a decade of commercial experience across enterprise technology sales, brand marketing, and procurement spanning the UK and Nigeria. Currently a Product Advisor at Microsoft UK, where he leads needs-led demonstrations of Microsoft devices and cloud services and guides customers through digital adoption. Earlier roles at Samsung Electronics UK and Russell & Bromley sharpened his ability to translate complex products into clear, customer-led value — directly relevant to taking ZettaBrain to enterprise teams.

At Brandlife Nigeria he planned and executed go-to-market campaigns for HP, Intel, and Microsoft across in-store and digital channels. As Procurement Manager at DesignHQ he ran end-to-end vendor relationships and negotiated cost savings of up to 10% on supplier contracts. Holds an MSc in International Business from the University of Chester and a BSc in Business Administration from Redeemer's University.

Enterprise Sales Go-to-Market Microsoft Brand Marketing MSc International Business
Backed By

Supported by Industry Leaders

ZettaBrain is recognized by leading technology startup programs, giving us access to world-class infrastructure, expertise, and go-to-market support.

NVIDIA
NVIDIA Inception
Member of NVIDIA's Inception Program for AI startups, providing deep learning resources and ecosystem access.
AWS
AWS Activate
AWS Activate member — cloud credits, technical support, training, and go-to-market collaboration from Amazon Web Services.
Google
Google for Startups
Google Cloud for Startups member with cloud infrastructure credits and access to Google's startup support network.
GitHub
GitHub Startups
GitHub Startups Program member with access to GitHub's developer tooling, collaboration infrastructure, and community.
Datadog
Datadog Startups
Datadog Startup Program member with monitoring, observability, and infrastructure analytics support for our platform.

Ready to Get Started?

Install ZettaBrain on your own server and start querying your documents today, or reach out and we'll walk you through it.

ZettaBrain RAG

Private Document AI

ZettaBrain lets you have natural language conversations with your own documents — running entirely on your own infrastructure, with no API keys and no data sent to any cloud service.

What It Does

It connects to your existing file storage (local disk, NFS, SMB, or S3-compatible), indexes your documents locally, and lets you query them in plain English through a web interface or CLI. Every answer includes the source chunks it came from.

Supported Platforms

PlatformInstallerNotes
Ubuntuinstall.shsystemd service, full GPU support
Red Hat Linuxinstall.shDNF / YUM, RHEL 8 / 9 / 10
macOS (Apple Silicon / Intel)install.shHomebrew, Metal GPU
Windows 10/11 / Server 2016+install.ps1 / install.cmdPowerShell, winget

Quick Links

ZettaBrain Teams

Multi-Tenant Private Document AI

ZettaBrain Teams is a self-hosted, multi-tenant RAG server. Each team in your organisation gets its own isolated document library, users authenticate via Active Directory, and everything runs on your own infrastructure — no cloud, no API keys, no data egress.

What It Does

ZettaBrain Teams ingests documents per team and lets each team query their own documents through a shared web interface. The admin panel manages users, teams, and AD integration. Every answer includes the source chunks it came from, and every query is logged to the audit trail.

Key Features

FeatureDetail
Per-team isolationSeparate ChromaDB collection and BM25 index per team — no cross-boundary leakage
Active Directory SSOLDAP bind-search-rebind with optional group membership enforcement
Hybrid retrievalMMR vector search + per-team BM25 + FlashRank cross-encoder re-ranking
Audit loggingEvery query logged with user, team, confidence, duration — exportable as CSV
Admin panelCreate users, manage teams, pull Ollama models, configure AD, view stats
REST APIFastAPI — all admin and chat operations available as JSON endpoints

Quick Links

Get in Touch

Request a
Briefing or Call

Tell us about your team's document workflows and what you're trying to solve. We're happy to walk you through ZettaBrain and discuss whether it fits your setup.

Quick Response

We aim to reply within one business day.

📧

Email Us Directly

info@zettabrain.io

💻

Open Source

ZettaBrain RAG is free and open source. View on GitHub →

🖥️

NVIDIA Inception Partner

ZettaBrain is a member of the NVIDIA Inception program.

Send a Message

Fields marked * are required. Your information is never shared with third parties.

By submitting, your message will be sent to info@zettabrain.io.

Message Sent

Your message has been directed to info@zettabrain.io. We'll be in touch within one business day.

Engineering Blog

Ideas from the ZettaBrain Team

Deep dives into private AI, multi-tenant RAG, and building production-grade document intelligence — on-prem.

RAG Multi-Tenant BM25

Per-Tenant BM25 in Hybrid RAG: How We Fixed Cross-Boundary Document Leakage

A shared BM25 index poisons IDF scores across tenant boundaries even when your vector store is correctly isolated. Here's how we caught it and fixed it with per-team BM25 indices.

Olajide Shobowale  ·  May 2026 Read →