YDB — Beyond Distributed SQL Database

Combine ACID transactions, fault tolerance, and limitless scalability with OLTP, OLAP, and streaming workloads in a single universal database for mission-critical applications.

Quick Start Docs for YDB v25.1 Editions

Bringing AI to Developers

Vector Similarity Search

Native vector index for approximate k nearest neighbors queries lets you build semantic search, recommendations, and RAG pipelines. Not bound by memory and scalable for up to billions of embeddings.

LLM-Ready SQL Queries

Use a familiar SQL dialect to query structured or JSON data, either manually or delegate it to your favorite LLM using the Model Context Protocol (MCP).

YDB MCP Server YQL reference

Built-in AI Assistant

Get context-aware advice on administering and querying YDB clusters.

Transactional Workloads

ACID Transactions and Strong Consistency

Global ACID transactions without inconsistencies, lost updates, or undesired stale reads. No need to handle obscure edge cases caused by consistency anomalies on the application side.

Learn more on transactions

Scalability

Add or remove nodes online to adapt to workload changes. Data is automatically sharded and transparently rebalanced, with compute and storage scaling independently.

Change Data Capture

Subscribe for a real-time stream of updates to row-oriented tables to process or react to them elsewhere. Multiple data formats and subscription options are available.

Learn more on CDC

Columnar Storage

Column-oriented tables deliver fast analytics on fresh data. Same transactions and consistency level as with row-oriented tables.

MPP Vectorized Query Engine

State-of-the-art distributed execution planner and optimizer scales analytical workloads to petabytes.

Federated Queries

Fetch data from multiple external sources in a single YDB query. Copy it to YDB storage or process on the fly.

Kafka-Compatible Streaming

YDB Topics and Kafka API 3.4.0

Persistent queues with exactly-once delivery and auto-partitioning. Reuse existing Kafka clients and tools without changing the driver, or develop using native YDB SDK.

More on YDB Topics Kafka API Reference

Topic-Table Transactions

YDB’s unified architecture allows reliable data movement between topics and tables in any direction with transactional guarantees.

More on YDB Transactions

Topic-to-Table Data Ingestion

Automatically and reliably ingest data from topics into tables for long-term storage without using external tools. Supports ingestion either within a single YDB database or between databases.

YDB is Trusted By

Enterprise Ready

Enterprise-Grade Security

YDB’s security features help meet industry-specific compliance like PCI-DSS, international standards like SOC 2 or ISO 27001, government requirements, and the expectations of even the strictest internal security teams.

Security Documentation

Multiple Deployment Options

For on-premises environments, YDB can be deployed with Ansible or purchased bundled with hardware as an appliance. For cloud environments, Kubernetes is also an option.

Ansible Kubernetes Contact Sales about Hardware Appliances

Multitenancy

YDB supports running multiple isolated databases in a single cluster with shared storage layer, as well as management of resources allocated to workloads working with the same database.

Workload Management (in Russian)

Observability and Backups

YDB integrates with common enterprise observability software via industry-standard protocols like Prometheus and OpenTelemetry. Data backups are usually performed on S3-compatible object stores or any filesystem.

YDB Observablity YDB Backups

Three Availability Zones

Maximum fault tolerance with up to 99,99% availability. Data remains available for reads and writes even if one availability zone and one server rack in another zone are fully unavailable simultaneously.

Two Availability Zones

Access your data reliably even when a third availability zone is not feasible for your business.

One Availability Zone

Erasure coding enables using half the disk space compared to three replicas, with the same fault tolerance. Achieve the lowest latencies by avoiding cross-datacenter traffic.

Single Node

Perfect for functional testing and prototyping.

Latest YDB Release v25.1

YDB v25.1 brings in approximate vector search, enhanced Apache Kafka compatibility, and consistent cross-cluster asynchronous replication.

Read more Roadmap

Top-Tier Results on Industry-Standard Performance Benchmarks

TPC-C

The TPC-C benchmark simulates complex OLTP workloads for transactional database systems, modeling a wholesale supplier’s order-entry environment with multiple transaction types, inventories, and performance measured in New-Order transactions per minute.

ClickBench

ClickBench evaluates analytical DBMS performance by simulating high-volume clickstream workloads with OLAP queries, aggregations, and filters, providing throughput, latency, and resource usage metrics for tuning.

Streaming

YDB Topics can provide throughput exceeding specialized systems like Apache Kafka and Apache Pulsar.

Results (in Russian)

Recent Talks About YDB

Designing YDB: Constructing a Distributed cloud-native DBMS for OLTP and OLAP from the Ground Up
Evgenii Ivanov at FOSSASSIA 2025

Choose Your YDB Edition

Open-Source

Complete source code on GitHub under the Apache 2.0 license. Community-supported. Self-host anywhere.

Quick Start ☆ GitHub

Enterprise

Enhanced security, compliance, and 24×7 SLA-backed commercial support.

Read More (in Russian)

Cloud

Managed YDB service on Yandex Cloud infrastructure with serverless and dedicated deployment options.

GitHub

Blog

Discord

X

LinkedIn

YouTube

Telegram