GitHub - xorq-labs/xorq: a compute manifest and tools for ML

The periodic table for ML computation.

Everything is an expression. Addressable. Composable. Portable.

Write high-level expression. Execute as SQL on DuckDB, Snowflake, BigQuery, or any engine. Every computation addressable, versioned, and reusable.

Documentation • Discord • Website

What is Xorq?

Machine learning (ML) infrastructure is fragmented—features in one system, models in another, lineage reconstructed through archaeology.

What if features, models, and pipelines aren't different things?

A feature is a computation. A model is a computation. A pipeline is computations composed. The vendor categories aren't computational truths—they're commercial territories. Strip away the product boundaries and everything reduces to the same primitive: the expression.

Xorq is the composability layer for compute expressed as relational plans.

Installation

pip install xorq[examples]
xorq init -t penguins

Full tutorial

Quickstart

import xorq.api as xo
from sklearn.ensemble import RandomForestClassifier

data = xo.read_parquet('s3://bucket/penguins.parquet')
train, test = xo.test_train_splits(data, test_size=0.2)

model = xo.Pipeline.from_instance(RandomForestClassifier())
fitted = model.fit(train, features=['bill_length_mm', 'bill_depth_mm'],
                   target='species')

predictions = fitted.predict(test).cache(storage=ParquetStorage())  # deferred
predictions.execute()  # do work

CLI:

xorq build expr.py -e predictions
xorq run builds/

How it works

Xorq captures your ML computation as an input-addressed manifest—a declarative representation where each node is identified by the hash of its computation specification, not its results.

# Manifest snippet: fit → predict lineage
predicted:
  op: ExprScalarUDF            # Model inference
  kwargs:
    bill_length_mm: ...        # Feature inputs
    bill_depth_mm: ...
  meta:
    __config__:
      computed_kwargs_expr:    # Training lineage preserved
        op: AggUDF             # Model training
        kwargs:
          species: ...         # Original training target

What This Enables

Capability	How
Version by intent	Same computation = same hash, regardless of input data
Precise caching	Cache based on what you're computing, not when
Structural lineage	Provenance is the graph itself, not reconstructed logs
Portable execution	Manifest compiles to optimized SQL for any engine

Input-Addressing

Every computation gets a unique hash based on its logic:

Same feature engineering on different days → same hash (reusable)
Different feature logic → different hash (new version)

If anyone on your team has run this exact computation before, Xorq reuses it automatically. The hash is the truth.

The Catalog

Your team's shared ledger of ML compute—versioned, discoverable, composable. Below is an example of what it looks like to add an expr to the catalog.

# Register a build with an alias.
❯ xorq catalog add builds/7061dd65ff3c --alias fraud-model

# Discover what exists.
❯ xorq catalog ls
Aliases:
fraud-model                  7061dd65ff3c     r2
customer-features            dbf90860-88b3    r1
recommendation-pipeline      52f987594254     r1

# Trace lineage.
❯ xorq lineage fraud-model

# Serve for inference.
xorq serve-unbound  fraud-model --port 8001 405154f690d20f4adbcc375252628b75

The catalog isn't a database. It's an addressing system—discoverable by humans, navigable by agents.

The Architecture

Learn more

Status

Pre-1.0. Expect breaking changes with migration guides.

Name		Name	Last commit message	Last commit date
Latest commit History 1,096 Commits
.envrcs		.envrcs
.github		.github
db		db
docker		docker
docs		docs
examples		examples
nix		nix
python/xorq		python/xorq
.codespell.ignore-words		.codespell.ignore-words
.envrc		.envrc
.git-blame-ignore-revs		.git-blame-ignore-revs
.gitattributes		.gitattributes
.gitignore.template		.gitignore.template
.pre-commit-config.yaml		.pre-commit-config.yaml
.vale.ini		.vale.ini
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
STYLEGUIDE.md		STYLEGUIDE.md
compose.yaml		compose.yaml
flake.lock		flake.lock
flake.nix		flake.nix
justfile		justfile
pyproject.toml		pyproject.toml
uv.lock		uv.lock
vendors.txt		vendors.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

What is Xorq?

Installation

Quickstart

How it works

What This Enables

Input-Addressing

The Catalog

The Architecture

Learn more

Status

About

Uh oh!

Releases 24

Uh oh!

Contributors 14

Uh oh!

Languages

License

xorq-labs/xorq

Folders and files

Latest commit

History

Repository files navigation

What is Xorq?

Installation

Quickstart

How it works

What This Enables

Input-Addressing

The Catalog

The Architecture

Learn more

Status

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 24

Uh oh!

Contributors 14

Uh oh!

Languages