Urja Labs — Writing

Urja Labs — WritingNotes on on-device AI, Kotlin Multiplatform, and shipping local LLMs.https://urjalabs.in/en-usWhat's new in NativeLM v0.9.0: charts in chat, an adaptive UI, and a real engine libraryhttps://urjalabs.in/blog/nativelm-v0-9-0/https://urjalabs.in/blog/nativelm-v0-9-0/v0.9 teaches the on-device model to answer with charts, makes the UI adapt from phone to tablet, and pulls the whole AI core out of the app into a reusable Kotlin Multiplatform library — still fully local, no account, no upload, no telemetry.Fri, 05 Jun 2026 00:00:00 GMTon-device-llmandroidreleasechartskotlin-multiplatformYour data, your key: local encrypted backup without a serverhttps://urjalabs.in/blog/nativelm-local-encrypted-backup/https://urjalabs.in/blog/nativelm-local-encrypted-backup/NativeLM keeps everything on your phone — which means losing the phone means losing the data. v0.7 fixes that with a passphrase-encrypted .nlmbak file you fully control: Argon2id → AES-256-GCM, no server, no account, no key we hold.Thu, 04 Jun 2026 00:00:00 GMTon-device-llmandroidprivacycryptographybackupTalk to your local LLM: on-device voice input with Whisperhttps://urjalabs.in/blog/nativelm-on-device-voice-input/https://urjalabs.in/blog/nativelm-on-device-voice-input/NativeLM v0.8 lets you dictate your questions — transcribed entirely on-device with Whisper (whisper.cpp), no cloud. Here's why we picked Whisper over Android's built-in recognizer, and how the Whisper model became a first-class 'Audio' entry in the model catalog.Thu, 04 Jun 2026 00:00:00 GMTon-device-llmandroidprivacyspeech-to-textwhisperThe OCR library that phoned home: restoring NativeLM's zero-telemetry guaranteehttps://urjalabs.in/blog/nativelm-zero-telemetry-mlkit/https://urjalabs.in/blog/nativelm-zero-telemetry-mlkit/Google's ML Kit gave NativeLM on-device OCR — and quietly bundled a datatransport pipeline that uploaded diagnostics to firebaselogging.googleapis.com on startup. Here's how we found it and stripped it out with a three-line manifest merge.Thu, 04 Jun 2026 00:00:00 GMTon-device-llmandroidprivacytelemetryml-kitAirDrop for your LLM: building cloudless peer-to-peer sync without Google Play Serviceshttps://urjalabs.in/blog/nativelm-p2p-sync/https://urjalabs.in/blog/nativelm-p2p-sync/How we built local device-to-device sync for NativeLM using mDNS and TCP sockets, keeping your private AI data completely off the cloud—and why we explicitly avoided Google's Nearby Connections API.Wed, 03 Jun 2026 00:00:00 GMTon-device-llmandroidprivacysyncarchitectureAsk in your language, about your English documents: on-device cross-lingual RAGhttps://urjalabs.in/blog/nativelm-multilingual-rag/https://urjalabs.in/blog/nativelm-multilingual-rag/NativeLM v0.8 answers in Hindi, Tamil, Kannada and more — reading your English documents and replying in your language, with zero translation model. The whole feature is one prompt directive (plus one stubborn script bug).Wed, 03 Jun 2026 00:00:00 GMTon-device-llmandroidmultilingualragindiaTurning your documents into artifacts, on-device: NativeLM Studiohttps://urjalabs.in/blog/on-device-studio-nativelm/https://urjalabs.in/blog/on-device-studio-nativelm/NativeLM v0.6.0 adds Studio — generate briefings, FAQs, study guides, timelines, mind maps, and even spoken audio overviews from your own documents, entirely on the phone, via a map-reduce pipeline over on-device Gemma.Wed, 03 Jun 2026 00:00:00 GMTon-device-llmandroidgemmamap-reducetext-to-speechprivacyWhat's new in NativeLM v0.5.0: open, highlight, zoom, OCR, better retrievalhttps://urjalabs.in/blog/nativelm-v0-5-0/https://urjalabs.in/blog/nativelm-v0-5-0/v0.4 made on-device document chat work. v0.5 makes it usable — tap a citation to open the source at the exact page with the passage highlighted, pinch to zoom, chat with scans, and get sharper answers. Plus the bugs we fixed along the way.Tue, 02 Jun 2026 00:00:00 GMTon-device-llmandroidragocrreleaseChatting with scanned documents: on-device OCR (no cloud)https://urjalabs.in/blog/on-device-ocr-nativelm/https://urjalabs.in/blog/on-device-ocr-nativelm/NativeLM v0.5.0 reads scanned PDFs and photos with on-device OCR, and blends keyword + vector search so exact terms actually get retrieved — all without an image ever leaving the phone.Tue, 02 Jun 2026 00:00:00 GMTon-device-llmandroidocrragvector-searchprivacyThe low-end gauntlet: running a local LLM on budget Android phoneshttps://urjalabs.in/blog/nativelm-low-end-devices/https://urjalabs.in/blog/nativelm-low-end-devices/A local LLM that only runs on flagships isn't private AI for everyone — it's a toy for people with expensive phones. Here's how NativeLM tiers models across devices, why budget phones break in two different ways (RAM and the navigation bar), and what's still hard about the 4–6 GB tier.Mon, 01 Jun 2026 00:00:00 GMTon-device-llmandroidperformancememoryuxWhy Android's ActivityManager lies about RAM — and how litertlm-kmp works around ithttps://urjalabs.in/blog/android-oem-ram-lies/https://urjalabs.in/blog/android-oem-ram-lies/Xiaomi, Realme, and OPPO inflate reported RAM with swap-to-flash. Here's how we detect it and prevent OOM crashes when loading on-device LLMs.Mon, 01 Jun 2026 00:00:00 GMTon-device-llmandroidkotlin-multiplatformgemmaoemShipping on-device RAG: Building NativeLM for Androidhttps://urjalabs.in/blog/on-device-rag-nativelm/https://urjalabs.in/blog/on-device-rag-nativelm/How we implemented fully offline document RAG using MediaPipe's USE-Lite and ObjectBox HNSW vector search to ground Gemma's chat answers in imported PDFs.Mon, 01 Jun 2026 00:00:00 GMTon-device-llmandroidragvector-searchgemmaStateful KV-cache sessions for on-device Gemma on Androidhttps://urjalabs.in/blog/litertlm-kmp-v0-3-kv-cache-sessions/https://urjalabs.in/blog/litertlm-kmp-v0-3-kv-cache-sessions/How litertlm-kmp v0.3 makes multi-turn memory lossless and free — plus what an on-device CPU/GPU/NPU benchmark actually told me.Sat, 30 May 2026 00:00:00 GMTon-device-llmandroidkotlin-multiplatformgemmaSeeing on-device: multimodal image input for local Gemmahttps://urjalabs.in/blog/nativelm-multimodal-vision/https://urjalabs.in/blog/nativelm-multimodal-vision/litertlm-kmp v0.2.4 added vision — attach an image and the local Gemma model reasons over it, on-device. Here's how image attachments flow through the engine, why we default to the CPU vision backend, and the model gotcha that bites you on init.Tue, 26 May 2026 00:00:00 GMTon-device-llmandroidmultimodalvisionkotlinWrapping Google's LiteRT-LM into a Kotlin Multiplatform enginehttps://urjalabs.in/blog/litertlm-kmp-engine-architecture/https://urjalabs.in/blog/litertlm-kmp-engine-architecture/The engine origin story: how litertlm-kmp turns Google's LiteRT-LM into a clean KMP library — four core abstractions, a resumable SHA-256 download manager, typed-Kotlin-to-OpenAPI function calling, and the thread discipline that keeps a non-thread-safe native runtime honest.Mon, 25 May 2026 00:00:00 GMTon-device-llmandroidkotlinarchitecturekmp