Kalmantic Labs

Inference Lab

Inference Lab

Open-source tools for production AI systems. Static analysis, weight optimization, and inference tooling.

PeakInfer

Static Analysis for AI Applications

Find configuration drift, cost issues, and performance problems before they reach production. PeakInfer analyzes your AI application code and infrastructure configuration to surface issues that would otherwise only appear at scale.

Available NowOpen Source

Configuration Drift Detection

Identify when your AI configuration has drifted from baseline, before it impacts production performance or costs.

Cost Analysis

Surface cost issues in your AI pipeline — from token usage patterns to inefficient model routing.

Performance Profiling

Find latency bottlenecks, memory issues, and suboptimal inference patterns through static analysis.

Available On

Platforms

CLI

Run analysis from your terminal. Integrates into any CI/CD pipeline.

VS Code Extension

Real-time analysis as you write AI application code.

GitHub Action

Automated checks on every pull request. Catch issues before merge.

Claude MCP

Use as a Model Context Protocol server with Claude for AI-assisted analysis.

PeakWeights

Weight Optimization Library

Research-backed weight optimization techniques for efficient model deployment. PeakWeights bridges the gap between research and production by providing practical tools for model compression, quantization, and inference optimization.

Available on GitHub
View on GitHub

Open Source

Everything we build is public

Benchmarks, tools, and research. All on GitHub. Contributions welcome.