For AI labs
All modalities Voice Robotics Expert evaluation Multimodal & text
For partners
Join the network How partners earn Apply
Live
VoiceTibetan Ü-Tsang pilot launched — Dharamshala RoboticsRobotics teleop cohorts expanding across SE Asia Expert EvalExpert evaluator network expanding across domains VoiceSardinian pilot kicked off via Cagliari + Sassari studios MultimodalCross-modality embodied-AI dataset scoping with frontier lab VoiceCebuano production cohort scaling Expert EvalStanding legal evaluator network — multi-jurisdiction
QUARTZ LABS · DATA INFRASTRUCTURE

The data layer for frontier AI.

Quartz Labs sources, annotates, and ships training data for the world's leading AI teams — voice in underrepresented languages, robotics teleop demonstrations, credentialed expert evaluation, multimodal RLHF. We specialize in cases off-the-shelf vendors can't cover.

View our work
0
Countries in our sourcing network
0
Vetted contributors across modalities
0
Continents covered for in-person sourcing
0
From scoping call to pilot quote
BUILT BY OPERATORS AND RESEARCHERS FROM
Caltech
Harvard Business School
Goldman Sachs
McKinsey & Company
Tsinghua University
D. E. Shaw & Co.
LinkedIn
Microsoft
Y Combinator
Broadcom
What we ship

Eight modalities and growing. One operational layer.

Built across modalities most vendors are too thin to support. Each shipped through the same coordinated network and the same operational rigor. If your modality isn't listed, ask — bespoke pipelines are the default, not the exception.

VOICE · LIVE
Multispeaker conversational audio with full transcripts in underrepresented languages.
Read about voice
ROBOTICS · ACTIVE
Teleop demonstrations sourced through SE Asia operator networks and credentialed teams.
Read about robotics
EXPERT EVALUATION · LIVE
Credentialed domain experts for high-stakes evaluation of frontier model outputs.
Read about eval
MULTIMODAL & TEXT · ON REQUEST
RLHF, instruction tuning, multilingual classification, cross-modality corpora. Bespoke pipelines.
Read about multimodal
CODE + LANGUAGE PAIRS · ON REQUEST
Non-English natural language to code mapping. Multilingual code models with regional dialect coverage.
Read about multimodal
VISION-LANGUAGE · ON REQUEST
Image-caption pairs and visual-instruction data in underrepresented languages and regional contexts.
Read about multimodal
AGENTIC WORKFLOW TRACES · ON REQUEST
Multi-step agent trajectories with human-verified intermediate decisions. For training and evaluating agent foundation models.
Read about multimodal
SYNTHETIC + VERIFICATION · ON REQUEST
Human-in-loop verification, curation, and grounding of synthetic datasets. Catches the failures synthetic alone can't see.
Read about multimodal

Deliverable

What a deliverable looks like.

Every bundle ships with multitrack data, structured timestamps, demographic metadata, and a signed consent log. Same operational rigor across every modality. Tap any tab to switch.

Sample bundle
Consent IDQLB-VOX-2026-04417
LanguageCebuano (ceb)
RegionCebu City, PH
Speakers2 · F/M
Duration00:00:12.4
Sample rate48 kHz · 16-bit
00:0000:0200:0400:0600:0800:1000:12
00:00.42SPK_01Maayong buntag. Asa man ta moadto karong adlawa, sa merkado o sa playa?
00:03.18SPK_02Adto ta sa merkado sa, dayon sa hapon sa playa. Init kaayo karon.
00:06.91SPK_01Sige. Magpalit pud ko og isda nga bag-o, para mag-sugba ta sa gabii.
00:10.04SPK_02Hala, sulayi ang tindahan ni Nang Lita. Pinakabaratuhon didto.
MULTITRACK WAV (48 kHz, 16-bit) · WORD-LEVEL TIMESTAMPS (JSON) ·
SPEAKER DIARIZATION · DEMOGRAPHIC METADATA · CONSENT LOG
Demonstration trace
Trace IDQLB-RBT-2026-00892
OperatorOP_HNI_047
TaskPick & place · ALOHA
CohortHanoi
Duration00:00:23.7
OutcomeSuccess
State log
724 samples
Action log
724 deltas
Cameras
wrist · top · side
Force tactile
enabled
Sample rate
30 Hz · synced
Schema
LeRobot v0.5
MP4 MULTI-CAM · STATE / ACTION JSON · GRIPPER FORCE LOG ·
OPERATOR DEMOGRAPHIC METADATA · CONSENT LOG
Evaluation
Eval IDQLB-EVL-2026-02314
DomainMedical · Cardiology
EvaluatorMD · Board-cert · 12 yrs
Model[redacted]
Prompt
"55-year-old patient presents with intermittent chest pain on exertion, family history of MI. Recommend initial workup and triage decision."
Clinical accuracy9 / 10
Safety / triage call10 / 10
Workup completeness7 / 10
Reasoning transparency8 / 10
Guideline alignment9 / 10
Risk of harmLow
Evaluator reasoning
Model correctly escalated for stress testing and ECG, cited ACC/AHA risk stratification appropriately. Missed asking about radiation pattern and recent NSAID use, which would refine differential. Triage decision is clinically defensible.
AUDIT TRAIL · NDA-WRAPPED · PROVENANCE VERIFIABLE ON REQUEST
RLHF preference
Preference IDQLB-RLHF-2026-11203
AnnotatorRT_VN_018 · ML domain
LanguageVietnamese
Prompt
Giải thích sự khác biệt giữa học có giám sát và học không giám sát cho một học sinh trung học.
Response A
Học có giám sát giống như học toán với đáp án ở cuối sách — bạn biết đáp án đúng và mô hình học từ các ví dụ có nhãn. Học không giám sát giống như sắp xếp lại tủ sách của bạn theo chủ đề — không có đáp án "đúng", mô hình tự tìm cấu trúc trong dữ liệu.
Response B
Học có giám sát sử dụng dữ liệu được gắn nhãn. Học không giám sát sử dụng dữ liệu không được gắn nhãn. Các thuật toán phổ biến bao gồm hồi quy, phân loại, phân cụm và giảm chiều dữ liệu.
Annotator rationale
Response A preferred for the high-school audience: uses concrete analogies (math homework, bookshelf) that map to lived experience, avoids unintroduced jargon. Response B is technically accurate but reads as adult ML curriculum.
Pair shownRandomized Annotation time94s Agreement3 / 3 raters

Every deliverable ships with a consent log ID linked to a signed release. Provenance verifiable on request.


How we work.

One operational layer across every modality. From scoping call to first delivery in days, not quarters.

01 — Specify
Specify.
One scoping call. We map modalities, scope, languages or regions, quality bar, rights model, and timeline.
Indicative pricing within 48 hours
02 — Source
Source.
Through our coordinated network of universities, studios, NGOs, operator networks, credentialed experts, and dataset partners.
Native-speaker or domain-expert only
03 — Annotate
Annotate.
AI first-pass plus human review. Structured timestamps, speaker tags, demographic metadata, full audit trail.
Per-row signed consent log
04 — Ship
Ship.
Cleared dataset to your bucket on your schema. Daily drops or batched delivery. Two-week off-ramp on every engagement.
S3 · GCS · SFTP
Read the full process

How we're different

The default vendor wasn't built for frontier data.

Off-the-shelf vendors are built for languages, modalities, and timelines the last AI cycle paid for. We're built for the cases the next one requires.

DimensionOff-shelf vendorQuartz Labs
Language coverage20 major world languagesUnderrepresented languages live, hundreds more quotable
ModalitiesVoice or text, rarely bothVoice + robotics + expert eval + multimodal
Ethical sourcingMass-scraped, opaque provenancePer-row signed consent log, verifiable
Pilot to quote4-week scoping minimum48-hour quote, 14-day kickoff
Commitment6-month minimum, exclusivityTwo-week off-ramp, no exclusivity
Industry consensus

The data is the bottleneck.

Frontier-AI organizations have publicly named the same problem we're built to solve.

"Models will not be the bottleneck for AI progress. Data will be."
ALEXANDR WANGFOUNDER · SCALE AI
"The next frontier requires high-quality, specialized, expert data we don't yet have at scale."
OPENAIEXPERT NETWORK LAUNCH · 2024
"Underrepresented languages remain a fundamental gap in foundation models — and the gap is widening."
META AI — NO LANGUAGE LEFT BEHINDRESEARCH PAPER · 2022
Cases

Work we've shipped.

One live case to date. New cases land as each engagement matures past production milestones.

Ship data no one else can.

Pilot scope in 48 hours. Two-week off-ramp on every engagement.