Inference & ServingJun 02, 2026
Cutting inference cost on data-warehouse workloads
How a routing trick and smarter batching dropped a customer's per-row inference cost by 62%, with the numbers to back it up.
by Anika Patel
Engineering posts, changelogs, and contributions from the community. Written by people who ship.
How a routing trick and smarter batching dropped a customer's per-row inference cost by 62%, with the numbers to back it up.
A practical guide to going from warehouse rows to a fine-tuned model without writing a single ETL job by hand.
Spot H100s in three regions, an Iceberg connector, eval harness v2, and a leaner CLI. The May changelog.
Tomás Becker walks through the RAG pipeline he runs on Datavere for a healthcare workload, evals included.
From `pip install` to a trained model in under twenty minutes, with the full project on GitHub.
Sixty-three builders, three talks, a lot of pizza. Highlights, slides, and the open mic recordings.