Importing data from the file system

The tools restore command creates the items of the database schema in the database, and populates them with the data previously exported there with the tools dump command or prepared manually as per the rules from the File structure article:

ydb [connection options] tools restore -p PATH -i PATH [options]

where [connection options] are database connection options

If the table already exists in the database, no changes will be made to its schema. If some columns present in the imported files are missing in the database or have mismatching types, this may lead to the data import operation failing.

To import data to the table, use the YQL REPLACE command. If the table included any records before the import, the records whose keys are present in the imported files are replaced by the data from the file. The records whose keys are absent in the imported files aren't affected.

Required parameters

-p PATH or --path PATH: Path to the database directory the data will be imported to. To import data to the root directory, specify .. All the missing directories along the path will be created.

-i PATH or --input PATH: Path to the directory in the client system the data will be imported from.

Optional parameters

[options]: Optional parameters of the command:

--restore-data VAL: Enables/disables data import, 1 (yes) or 0 (no), defaults to 1. If set to 0, the import only creates items in the schema without populating them with data. If there's no data in the file system (only the schema has been exported), it doesn't make sense to change this option.

--restore-indexes VAL: Enables/disables import of indexes, 1 (yes) or 0 (no), defaults to 1. If set to 0, the import won't either register secondary indexes in the data schema or populate them with data.

--dry-run: Matching the data schemas in the database and file system without updating the database, 1 (yes) or 0 (no), defaults to 0. When enabled, the system checks that:

  • All tables in the file system are present in the database
  • These items are based on the same schema, both in the file system and in the database

--save-partial-result: Save the partial import result. If disabled, an import error results in reverting to the database state before the import.

Workload restriction parameters

Using the below parameters, you can limit the import workload against the database.

Attention!

Some of the below parameters have default values. This means that the workload will be limited even if none of them is mentioned in tools restore.

--bandwidth VAL: Limit the workload per second, defaults to 0 (not set). VAL specifies the data amount with a unit, for example, 2MiB.
--rps VAL: Limits the number of queries used to upload batches to the database per second, the default value is 30.
--in-flight VAL: Limits the number of queries that can be run in parallel, the default value is 10.
--upload-batch-rows VAL: Limits the number of records in the uploaded batch, the default value is 0 (unlimited). VAL determines the number of records and is set as a number with an optional unit, for example, 1K.
--upload-batch-bytes VAL: Limits the batch of uploaded data, the default value is 512KB. VAL specifies the data amount with a unit, for example, 1MiB.
--upload-batch-rus VAL: Applies only to Serverless databases to limit Request Units (RU) that can be consumed to upload one batch, defaults to 30 RU. The batch size is selected to match the specified value. VAL determines the number of RU and is set as a number with an optional unit, for example, 100 or 1K.

Examples

Note

The examples use the quickstart profile. To learn more, see Creating a profile to connect to a test database.

Importing to the database root

From the current file system directory:

ydb -p quickstart tools restore -p . -i .

From the current file system directory:

ydb -p quickstart tools restore -p . -i ~/backup_quickstart

Uploading data to the specified directory in the database

From the current file system directory:

ydb -p quickstart tools restore -p dir1/dir2 -i .

From the current file system directory:

ydb -p quickstart tools restore -p dir1/dir2 -i ~/backup_quickstart

Matching schemas between the database and file system:

ydb -p quickstart tools restore -p dir1/dir2 -i ~/backup_quickstart --dry-run