A collection of work I have done for the past while that I’m proud of.

A backlog of unfinished ideas can be found here.


writing.

You can find internal monologue under posts index.

open source.

  • Quartz - 🌱 a fast, batteries-included static-site generator that transforms Markdown content into fully functional websites (2023-)

  • avante.nvim - A Cursor-like chat IDE for Neovim (2024-)

  • OpenLLM - Run any open-source LLMs as OpenAI compatible API endpoint in the cloud. (2023-)

    • 🔬 Build for fast and production usages
    • 🚂 Support Llama, Qwen, Gemma, etc, and quantized versions
    • ⛓️ OpenAI-compatible API
    • 💬 Built-in ChatGPT like UI
    • 🔥 Accelerated LLM decoding with state-of-the-art inference backends
    • Source: GitHub
  • BentoML - Build Production-grade AI Application (2021-)

    • a framework that simplifies machine learning model deployment and provides a faster way to ship your model to production. Supports a variety of use cases, from classical ML to LLMs, diffusions models.
    • Built using Python, BuildKit, gRPC
    • Source: GitHub, Documentation
  • incogni.to - a pseudonymous event platform that curates for those yearning to be seen for who they are, not what they can “sell” (2024)

    • Implemented a RAG pipeline for recommendation system based on users preferences and interests, with command-r-plus-08-2024, deployed with vLLM and [@github.com/bentoml/BentoML].
    • Added semantic search to find relevant events based on query with Cohere Rerank
    • General UI implementation with shadcn/ui and vercel/next.js
    • Demoed at New Build’24
    • Source: stream, posts
  • onw - A real-time navigation tools for safer commute (2021)

    • Implemented route optimization, heat map visualization to identify hot zones, peer notification system.
    • Added a heuristic Gaussian Mixture Model to find the safest path between different locations, trained on past assault data provided by Toronto Police Department.
    • Awarded: Finalists at Hack the North 2021.
    • Built using AWS Fargate, React Native, TypeScript, GraphQL, Apache Spark MLlib, Google Maps API
    • Source: GitHub, devpost

talks.

  • OpenLLM, and everything about running LLMs in production at Hack The North (2023)

[^ref]