Skip to content

Add ONNX inference documentation#49

Merged
alay2shah merged 14 commits intomainfrom
onnx
Feb 5, 2026
Merged

Add ONNX inference documentation#49
alay2shah merged 14 commits intomainfrom
onnx

Conversation

@alay2shah
Copy link
Contributor

@alay2shah alay2shah commented Jan 31, 2026

  • ONNX code examples for Python + WebGPU
  • LiquidONNX tool docs
  • Link model cards to table
  • Deprecate some pages for LLM.txt indexing
  • Vision llama cpp examples in model cards (lagging commit)


## Pre-exported Models

Pre-exported ONNX models are available from LiquidAI and the [onnx-community](https://huggingface.co/onnx-community). Check the [Model Library](/docs/models/complete-library) for a complete list of available formats.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd add a link on "from LiquidAI" that points on our HF

Comment on lines 24 to 32
### Installation

```bash
pip install onnxruntime transformers numpy huggingface_hub jinja2

# For GPU support
pip install onnxruntime-gpu transformers numpy huggingface_hub jinja2
```

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe instead point on the onnx-export repo directly or use it

git clone ...
uv sync
uv ryn ... 

alay2shah and others added 14 commits February 5, 2026 12:47
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
- Updated complete-library.mdx table with ONNX links for 5 models
- Added ONNX buttons to LFM2-8B-A1B, LFM2-VL-3B, LFM2-VL-1.6B,
  LFM2-VL-450M, and LFM2.5-Audio-1.5B model pages

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Delete LEAP pages that are not in navigation and contain outdated content:
- find-model.mdx
- vibe-check-models.mdx
- index.mdx
- finetuning.mdx
- laptop-support.mdx

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Delete docs pages that are not in navigation:
- docs/index.mdx (redirect page)
- docs/key-concepts/models.mdx (deprecated models page)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Change /leap/index references to /leap/edge-sdk/overview
since the old LEAP index page was removed.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Make all model names in the Model Chart link to their respective
model pages for easier navigation. Also adds LFM2-2.6B-Exp to table.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Add llama.cpp tab with installation and usage instructions
for VL models: LFM2-VL-3B, LFM2-VL-1.6B, LFM2-VL-450M, LFM2.5-VL-1.6B

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Style first-column model name links differently from checkmark links:
- No underline, regular text appearance
- Purple text + light background on hover
- Distinct from green checkmark links in other columns

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-authored-by: Yuri Khrustalev <ykhrustalev@users.noreply.github.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Move LiquidONNX to the top as the main export/inference method.
Simplify by removing verbose export options table (users can
check the repo for advanced options).

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Reorder sections to show WebGPU (browser) inference before Python.
Collapse the lengthy Python KV cache example into an accordion.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Link to LFM2 chat and LFM2.5 Audio browser demos as examples
of deployed ONNX/WebGPU inference.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
@alay2shah alay2shah merged commit 1878394 into main Feb 5, 2026
6 checks passed
@alay2shah alay2shah deleted the onnx branch February 5, 2026 17:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants