Fondo 2.0

Cactus recently launched!

Launch YC: Cactus 🌵: Deploy AI models locally on smartphones

‍

^{"Run AI on-device and cross-platform with their lightweight inference framework."}

^{‍

TLDR: Deploy AI models locally, privately, and offline in any app using Cactus.}*^Cactus^{is a blazing-fast inference engine optimized for smartphones and comes with React Native, Flutter, and Kotlin bindings.}***

‍

https://youtu.be/xwKrmYkJZD8

Founded by Roman Shemet & Henry Ndubuaku

‍

Their framework:

Cactus is a cross-platform & open-source framework for doing inference on smartphones, wearables, and other low power devices. It supports any LLM or VLM available on HuggingFace directly.

The recently released Google AI Edge and Apple Foundation Frameworks are platform-specific and primarily support specific models from the companies.

To this end, Cactus:

Is available in Flutter and React-Native for cross-platform developers, since most apps are built with these today.
Supports any GGUF model you can find on Huggingface; Qwen, Gemma, Llama, DeepSeek, Phi, Mistral, SmolLM, SmolVLM, InternVLM, Jan Nano etc.
Accommodates from FP32 to as low as 2-bit quantized models, for better
efficiency and less device strain.
Have MCP tool-calls to make models performant, truly helpful (set reminder, gallery search, reply messages) and more.
Fallback to big cloud models for complex, constrained or large-context tasks, ensuring robustness and high availability.

So far, their customers have built:

Personalised and private RAG and prompt-enhancement pipelines for their app users.
Offline fallback for the big remote AI models.
Phone tool use agents like gallery & calendar management.
AI for medical and other privacy-pertinent industries.

‍

Some demos:

LLMs and embedding models

‍

Real-time vision inference

‍

Learn More

^‍

^{🌐 Visit}^{www.cactuscompute.com}^{to learn more.}

‍

*^{⚡ Check them out on}^Discord***^.

^‍

*^{⭐ Give Cactus a star on}^Github***^.

^‍

*^{👣 Follow Cactus on}^LinkedIn***^.

‍

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

About the author

David J. Phillips

CEO & Founder

David is the CEO & Founder of Fondo (YC W18). He is an angel investor in Rippling, Flexport, LiquidDeath, and 100+ other startups. David began his career as an accountant at Deloitte before learning to code and becoming a founder. Previously, he was co-founder of Hackbright where 1,000+ software engineers have been trained and placed at tech companies including Slack, Disney, and Uber and was acquired by Capella Education NASDAQ: $CPLA in 2016.

launch

Cactus 🌵 Launches: Deploy AI Models Locally on Smartphones

David J. Phillips

"Run AI on-device and cross-platform with their lightweight inference framework."

‍TLDR: Deploy AI models locally, privately, and offline in any app using Cactus. Cactus is a blazing-fast inference engine optimized for smartphones and comes with React Native, Flutter, and Kotlin bindings.

Their framework:

Some demos:

LLMs and embedding models

Real-time vision inference

Learn More

‍

🌐 Visit www.cactuscompute.com to learn more.

⚡ Check them out on Discord.

⭐ Give Cactus a star on Github.

‍

👣 Follow Cactus on LinkedIn.

Heading

About the author

David J. Phillips

More posts

Pynecone launches: The easiest way to build and deploy web apps, using pure python

Arva AI Launches: AI agents for Instant Global KYB Onboarding

Startup Refund Accounting: Navigating Costs and Tax Complexities for Business Success

Your accounting, taxes, and tax credits on autopilot.

Join our newsletter!

Company

Platform

Resources

Founder Guides

Sign Up

Pages

Home pages

About pages

Contact pages

Pricing pages

Blog pages

Team members pages

Services pages

Help center pages

Internal pages

Careers pages

Utility pages

Get a demo pages

Coming soon pages

Webinar pages

Thank you pages

Lead form landing pages

E-book pages

Template pages

^{"Run AI on-device and cross-platform with their lightweight inference framework."}

^{‍

TLDR: Deploy AI models locally, privately, and offline in any app using Cactus.}*^Cactus^{is a blazing-fast inference engine optimized for smartphones and comes with React Native, Flutter, and Kotlin bindings.}***

^‍

^{🌐 Visit}^{www.cactuscompute.com}^{to learn more.}

*^{⚡ Check them out on}^Discord***^.

*^{⭐ Give Cactus a star on}^Github***^.

^‍

*^{👣 Follow Cactus on}^LinkedIn***^.