Skip to content
tatami

CLI reference

Every tatami command and flag.

tatami [command] [flags]

The CLI inspects, reads, converts, and catalogs .tatami files. Run tatami <command> --help for the canonical, up-to-date list.

tatami inspect

tatami inspect <file>

Prints a header and footer summary: the format version, row and group counts, the role (document store or search segment), the compressed and uncompressed sizes, and a per-column line with the encoding, codec, value and null counts, and any index structures (bloom filter, blob region, sort key). It reads only the header and footer, so it is instant on a file of any size.

tatami cat

tatami cat <file> [flags]

Streams the rows of a file to standard output as JSONL, one object per line.

Flag Default Meaning
--columns all Comma-separated columns to project
--limit 0 Stop after this many rows (0 = no limit)

Projecting a subset of columns reads only those columns off disk. A blob-separated body column is read only if you project it.

tatami convert

tatami convert <in.parquet> <out.tatami> [flags]

Re-encodes a producer Parquet shard as tatami. Reads the Parquet leaf schema, maps each column to a tatami logical type, applies the layout heuristics below, and streams the rows in bounded memory. Prints the size both ways when it finishes.

Flag Default Meaning
--blob markdown,body,html Comma-separated columns to separate into the blob region
--bloom doc_id,url,digest Comma-separated columns to build a membership filter on
--dict all other strings Comma-separated string columns to hint toward the shared dictionary
--batch 4096 Rows to read and append at a time (0 = default)

Passing an empty list (--bloom "") disables that heuristic; omitting a flag keeps its default.

tatami collection

tatami collection [command]

Manages the tatami.manifest catalog over a directory of files. See managing a collection for the workflow.

tatami collection add

tatami collection add <dir> <file.tatami>...

Catalogs one or more files into the collection rooted at <dir>, recording each file's key range and zone-statistic rollup so a query can prune it before opening it.

tatami collection list

tatami collection list <dir>

Lists the live members of the collection, each with its key span (if it has a sort key) and the columns the manifest can prune on.

tatami collection compact

tatami collection compact <dir>

Rolls the append-only manifest log into a fresh one containing only the live set, dropping accumulated add-and-remove churn. The swap is atomic (write a temp file, rename it into place).

tatami version

tatami --version

Prints the version, the commit it was built from, and the build date.