GitHub - paiml/aprender: Next Generation Machine Learning, Statistics and Deep Learning in PURE Rust

Quick Start

cargo install aprender
apr pull qwen2.5-coder-1.5b
apr run qwen2.5-coder-1.5b "What is 2+2?"

What is Aprender?

Aprender is a complete ML framework built from scratch in Rust. One cargo install, one apr binary, 57 commands covering the full ML lifecycle:

Stage	Commands	What it does
Inference	`apr run`, `apr chat`, `apr serve`	Run models locally (GGUF, SafeTensors, APR)
Training	`apr finetune`, `apr train`, `apr distill`	LoRA/QLoRA fine-tuning, knowledge distillation
Model Ops	`apr convert`, `apr quantize`, `apr merge`, `apr export`	Format conversion, quantization, model merging
Inspection	`apr inspect`, `apr validate`, `apr tensors`, `apr diff`	Model debugging, validation, comparison
Profiling	`apr profile`, `apr bench`, `apr qa`	Roofline analysis, benchmarks, QA gates
Registry	`apr pull`, `apr list`, `apr rm`, `apr publish`	HuggingFace Hub integration, model cache
GPU	`apr gpu`, `apr parity`, `apr ptx`	GPU status, CPU/GPU parity checks, PTX analysis
Monitoring	`apr tui`, `apr monitor`, `apr cbtop`	Terminal UI, training monitor, ComputeBrick pipeline

Numbers

70 workspace crates (was 20 separate repos)
25,391 tests, all passing
405 provable contracts (equation-based verification)
57 CLI commands with contract coverage
0 [patch.crates-io] — clean workspace deps

Install

# Install the `apr` binary
cargo install aprender

# Verify
apr --version

CLI Examples

# Run inference
apr run hf://Qwen/Qwen2.5-Coder-1.5B-Instruct-GGUF "Explain quicksort"
apr chat hf://meta-llama/Llama-3-8B-Instruct-GGUF

# Serve model as API
apr serve model.gguf --port 8080

# Inspect model
apr inspect model.gguf
apr validate model.apr --quality --strict
apr tensors model.gguf | head -20

# Fine-tune with LoRA
apr finetune model.gguf --adapter lora --rank 64 --data train.jsonl

# Convert formats
apr convert model.safetensors --quantize q4_k -o model.gguf
apr export model.apr --format gguf -o model.gguf

# Profile
apr profile model.gguf --roofline
apr bench model.gguf --assert-tps 100

Library Usage

The ML library is available as aprender on crates.io:

[dependencies]
aprender-core = "0.29"

use aprender::linear_regression::LinearRegression;
use aprender::traits::Estimator;

let model = LinearRegression::new();
model.fit(&x_train, &y_train)?;
let predictions = model.predict(&x_test)?;

Algorithms: Linear/Logistic Regression, Decision Trees, Random Forest, GBM, Naive Bayes, KNN, SVM, K-Means, PCA, ARIMA, ICA, GLMs, Graph algorithms, Bayesian inference, text processing, audio processing.

Architecture

Monorepo with 70 crates in flat crates/aprender-* layout (same pattern as Polars, Burn, Nushell):

paiml/aprender/
├── Cargo.toml                    # Workspace root + `cargo install aprender`
├── crates/
│   ├── aprender-core/            # ML library (use aprender::*)
│   ├── apr-cli/                  # CLI logic (57 commands)
│   ├── aprender-compute/         # SIMD/GPU compute (was: trueno)
│   ├── aprender-gpu/             # CUDA PTX kernels (was: trueno-gpu)
│   ├── aprender-serve/           # Inference server (was: realizar)
│   ├── aprender-train/           # Training loops (was: entrenar)
│   ├── aprender-orchestrate/     # Agents, RAG (was: batuta)
│   ├── aprender-contracts/       # Provable contracts (was: provable-contracts)
│   ├── aprender-profile/         # Profiling (was: renacer)
│   ├── aprender-present-*/       # TUI framework (was: presentar)
│   ├── aprender-db/              # Embedded analytics DB
│   ├── aprender-graph/           # Graph database
│   ├── aprender-rag/             # RAG pipeline
│   └── ... (70 crates total)
├── contracts/                    # 405 provable YAML contracts
└── book/                         # mdBook documentation

Performance

Model	Format	Speed	Hardware
Qwen2.5-Coder 1.5B	Q4_K	40+ tok/s	CPU (AVX2)
Qwen2.5-Coder 7B	Q4_K	225+ tok/s	RTX 4090
TinyLlama 1.1B	Q4_0	17 tok/s	CPU (APR format)

Framework Comparison

Benchmarked against real inference engines on Qwen2.5-Coder 7B Q4_K (RTX 4090). Data from candle-vs-apr proof-of-concept:

Inference Speed (single request, decode tok/s)

Engine	tok/s	vs Candle	Architecture
llama.cpp b7746	443.6	1.95x	C++, Flash Attention
aprender (realizr)	369.9	1.63x	Rust, CUDA graph + Flash Decoding
Candle	227.4	1.00x	Rust, per-op dispatch

Batched Throughput (aprender only — Candle has no server)

Concurrency	Agg tok/s	Scaling	Method
1	367	1.0x	Single request
8	954	2.6x	Continuous batching
32	3,220	8.8x	Orca-style iteration scheduling

Why Aprender Beats Candle

aprender advantage	Candle limitation	Impact
CUDA graph (647 kernels, 1 launch)	Per-op dispatch (~640 launches)	+26%
Flash Decoding (chunked KV)	Standard SDPA	+15% long ctx
Fused DP4A GEMV (4-bit native)	Separate dequant + matmul	~10%
Continuous batching server	CLI only, no server	8.8x at c=32

ML Training: apr vs Ludwig

From ground-truth-apr-ludwig (21 recipes, Popperian falsification methodology):

Capability	aprender	Ludwig	Notes
Classification (Iris, Wine)	`apr finetune`	`ludwig train`	Both achieve >95% accuracy
LoRA fine-tuning	`apr finetune --lora`	Not native	apr: rank-64 in minutes
Quantization (INT8/INT4)	`apr quantize`	Not supported	apr-native capability
Model merging	`apr merge --strategy ties`	Not supported	TIES/DARE/SLERP
Provable contracts	405 YAML contracts	None	Equation-based verification
Single binary	`cargo install aprender`	`pip install ludwig`	Rust vs Python

All benchmarks reproducible from linked repos with cargo test.

Provable Contracts

Every CLI command and kernel has a provable contract (contracts/*.yaml) with equations, preconditions, postconditions, and falsification tests:

equations:
  validate_exit_code:
    formula: exit_code = if score < 50 then 5 else 0
    invariants:
    - score < 50 implies exit_code != 0
falsification_tests:
- id: FALSIFY-CLI-001
  prediction: apr validate bad-model.apr exits non-zero

405 contracts across inference, training, quantization, attention, FFN, tokenization, model formats, and CLI safety.

Migration from Old Crates

All old crate names still work via backward-compatible shim crates:

Old	New	Status
`trueno = "0.18"`	`aprender-compute = "0.29"`	Shim available
`entrenar = "0.7"`	`aprender-train = "0.29"`	Shim available
`realizar = "0.8"`	`aprender-serve = "0.29"`	Shim available
`batuta = "0.7"`	`aprender-orchestrate = "0.29"`	Shim available

Old repositories are archived and read-only. All development happens here.

Contributing

git clone https://github.com/paiml/aprender
cd aprender
cargo test --workspace --lib    # 25,391 tests
cargo check --workspace         # 70 crates
apr --help                      # 57 commands

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2,727 Commits
.cargo		.cargo
.claude/skills		.claude/skills
.config		.config
.githooks		.githooks
.github		.github
.pv		.pv
book		book
contracts		contracts
crates		crates
docs		docs
fuzz		fuzz
scripts		scripts
src		src
.bashrsignore		.bashrsignore
.cargo-mutants.toml		.cargo-mutants.toml
.clippy.toml		.clippy.toml
.gitignore		.gitignore
.pmat-baseline.json		.pmat-baseline.json
.pmat-gates.toml		.pmat-gates.toml
.pmat-metrics.toml		.pmat-metrics.toml
.pmat.yaml		.pmat.yaml
.proptest.toml		.proptest.toml
.pv.toml		.pv.toml
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
RELEASE.md		RELEASE.md
ROADMAP.md		ROADMAP.md
codecov.yml		codecov.yml
deny.toml		deny.toml
pmat.toml		pmat.toml
rust-toolchain.toml		rust-toolchain.toml
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quick Start

What is Aprender?

Numbers

Install

CLI Examples

Library Usage

Architecture

Performance

Framework Comparison

Inference Speed (single request, decode tok/s)

Batched Throughput (aprender only — Candle has no server)

Why Aprender Beats Candle

ML Training: apr vs Ludwig

Provable Contracts

Migration from Old Crates

Contributing

License

About

Uh oh!

Releases 17

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Quick Start

What is Aprender?

Numbers

Install

CLI Examples

Library Usage

Architecture

Performance

Framework Comparison

Inference Speed (single request, decode tok/s)

Batched Throughput (aprender only — Candle has no server)

Why Aprender Beats Candle

ML Training: apr vs Ludwig

Provable Contracts

Migration from Old Crates

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 17

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages