Installation
Deployment profiles
| Profile | Install | Engines | Latency |
|---|---|---|---|
| Lite | pip install inferwall | Heuristic (Rust) | <0.3ms p99 |
| Standard | pip install inferwall[standard] | + Classifier (ONNX) + Semantic (FAISS) | <80ms p99 |
| Full | pip install inferwall[full] | + LLM-Judge | <2s p99 |
Post-install setup
Generate API keys
iwk_scan_…) and an admin key (iwk_admin_…) and writes them to .env.local.Set environment variables
Export the generated keys before starting the server:Or source the generated file directly:
Install ML models (Standard and Full only)
If you installed the Models are cached in
standard or full profile, download the ML models:~/.cache/inferwall/models/ and downloaded from HuggingFace (~730 MB for Standard).In development, you can skip API key setup entirely. Run
inferwall serve without setting IW_API_KEY or IW_ADMIN_KEY and scan without any Authorization header. Dev mode is not suitable for production.Environment variables
| Variable | Description | Default |
|---|---|---|
IW_API_KEY | Scan API key | None (dev mode) |
IW_ADMIN_KEY | Admin API key | None (dev mode) |
IW_HOST | Server bind host | 0.0.0.0 |
IW_PORT | Server port | 8000 |
IW_TLS | TLS mode: auto, off, or acme | off |
IW_PROFILE | Deployment profile: lite, standard, full | lite |
IW_LOG_LEVEL | Log verbosity: debug, info, warning, error | info |
IW_REDIS_URL | Redis URL for distributed sessions | None |
TLS modes
| Mode | Behavior |
|---|---|
off | Plain HTTP (default) |
auto | TLS using a certificate at the path provided in IW_TLS |
acme | Automatic certificate provisioning via ACME/Let’s Encrypt |
Redis for distributed sessions
SetIW_REDIS_URL to enable distributed rate limiting and session state across multiple InferenceWall instances:
Health check endpoints
| Endpoint | Purpose | Use in |
|---|---|---|
GET /v1/health/live | Liveness — is the process alive? | Kubernetes livenessProbe |
GET /v1/health/ready | Readiness — can it handle requests? | Kubernetes readinessProbe |
GET /v1/health | Full health with signature count and engine status | Monitoring dashboards |
Further reading
Environment variables reference
Complete list of all environment variables with types, defaults, and valid values.
Health API
Response schemas for the liveness, readiness, and full health endpoints.