What is LFM2.5-1.2B-Thinking?

LFM2.5-1.2B-Thinking is a 1.2 billion parameter open weight reasoning model developed by Liquid AI. It is designed to run entirely on device, fitting within 900MB of memory while generating thinking traces to solve complex problems systematically without cloud connectivity.

How much memory does LFM2.5-1.2B-Thinking require?

The model fits within 900MB of memory on smartphones. Typical deployments use 720MB to 853MB depending on quantization and hardware platform, making it suitable for edge devices with limited resources.

What hardware platforms are supported?

The model supports Qualcomm Snapdragon NPUs, AMD Ryzen AI processors, Apple Silicon, and NVIDIA GPUs. It offers native integration with llama.cpp, MLX, vLLM, and ONNX Runtime for optimized inference across these platforms.

Is LFM2.5-1.2B-Thinking free to use?

Yes, the model is available as open weight, meaning it is free to download, fine tune, and deploy without restrictions. Enterprise customers can contact Liquid AI sales for custom deployment solutions and support.

What is curriculum RL training?

Curriculum RL is Liquid AI's training methodology that uses parallel domain specific tracks for instruction following, reasoning, math, and tool use. Rather than training on all domains simultaneously, the model merges specialized checkpoints iteratively to preserve capabilities without interference.

LFM2.5-1.2B-Thinking

LFM2.5-1.2B-Thinking is a 1.2 billion parameter open weight reasoning model by Liquid AI that runs entirely on device under 1GB memory. It delivers advanced mathematics, tool use, and instruction following capabilities for edge AI deployment.

Visit LFM2.5-1.2B-Thinking View Media

About LFM2.5-1.2B-Thinking

LFM2.5-1.2B-Thinking is a compact reasoning model designed to bring data center level AI capabilities to mobile and edge devices. With just 1.2 billion parameters, it fits within 900MB of memory on smartphones while generating explicit thinking traces to solve complex problems systematically. The model represents a significant advancement in on device AI, enabling sophisticated reasoning without internet connectivity or cloud dependencies.

Developed by Liquid AI, this model employs a unique curriculum reinforcement learning approach that trains domain specific capabilities in parallel rather than simultaneously. This methodology prevents capability interference while optimizing for mathematics, tool use, and advanced reasoning. The model achieves superior performance compared to larger alternatives, requiring 40% fewer parameters while delivering faster inference and more concise outputs.

The ecosystem support is extensive, with day zero compatibility across Qualcomm Snapdragon NPUs, AMD Ryzen AI processors, Apple Silicon, and NVIDIA GPUs. Native integration with popular frameworks like llama.cpp, MLX, vLLM, and Ollama ensures developers can deploy immediately across vehicles, smartphones, laptops, IoT devices, and embedded systems.

Key Features

On Device Reasoning: Generates explicit thinking traces before producing answers, enabling systematic problem solving for mathematics and logic entirely offline.
Ultra Low Memory: Fits within 900MB on phones and 720MB to 853MB on typical deployments, making advanced AI accessible for resource constrained environments.
Curriculum RL Training: Uses parallel domain specific tracks with iterative model merging to optimize reasoning, math, and tool use capabilities without interference.
Doom Loop Prevention: Reduces repetitive generation patterns from 15.74% to 0.36% through advanced preference alignment and n gram repetition penalties.
Long Context Support: Maintains robust performance up to 32K context length with sustained decoding throughput of 52 tok/s at 16K context on NPUs.
Broad Hardware Ecosystem: Native support for Qualcomm Hexagon, AMD XDNA, and Apple Neural Engine via partnerships with Nexa AI and FastFlowLM.

Pricing

Open Weight: $0 Free to download, fine tune, and deploy without restrictions from Hugging Face and Liquid AI repositories.
Enterprise: Contact sales Custom pricing and solutions available for enterprise deployments requiring additional support or specialized implementations.

Pricing last updated: February 26, 2026 at 11:20 AM