Our Products

A complete stack for local AI inference — from the runtime to the agent layer to security.

xCore

The native inference runtime for NVIDIA Blackwell + Ampere GPUs. 50+ hand-tuned CuTile kernels compiled directly to GPU machine code. BF16, FP8, and NVFP4 with automatic detection. OpenAI-compatible API, continuous batching, and streaming — one binary for any Blackwell GPU.

CuTile Kernels NVFP4 / FP8 / BF16 OpenAI-compatible API Continuous Batching

xAgent

An autonomous AI agent framework built on top of xCore. Deploy intelligent agents that reason, plan, and execute tasks locally with full hardware acceleration — no cloud dependency required.

Local Agents Tool Use Multi-step Reasoning Offline Capable

xGuard

Enterprise-grade security and safety layer for local AI deployments. Content filtering, prompt injection detection, and compliance monitoring — all running on-device with zero data leaving your network.

On-device Security Prompt Injection Detection Content Filtering Compliance

Ready for the Next Generation?

Join the early access program and experience the speed of Xanuedge in your own environment.