profile pic
⌘ '
raccourcis clavier

I would like to think of the work I do as craft, a celebration of the act of making itself.

There is also a backlog of unfinished ideas that I might work on, one day.


writing.

You can find internal monologue under posts index.

open source.

  • Quartz - 🌱 a fast, batteries-included static-site generator that transforms Markdown content into fully functional websites (2023-)

  • avante.nvim - 🎓 A Cursor-like chat IDE for Neovim (2024-)

  • tinymorph - ✍️ An exploration into how we build interfaces for machine-assisted writing tool (2024-) [^tinymorph]

  • OpenLLM - ⚙️ Run any open-source LLMs as OpenAI compatible API endpoint in the cloud. (2023-)

    • 🔬 Build for fast and production usages
    • 🚂 Support Llama, Qwen, Gemma, etc, and quantized versions
    • ⛓️ OpenAI-compatible API
    • 💬 Built-in ChatGPT like UI
    • 🔥 Accelerated LLM decoding with state-of-the-art inference backends
    • Source: github
  • BentoML - 🍱 Build Production-grade AI Application (2021-) (Yang et al., n.d.)

    • a framework that simplifies machine learning model deployment and provides a faster way to ship your model to production. Supports a variety of use cases, from classical ML to LLMs, diffusions models.
    • Built using Python, BuildKit, gRPC
    • Source: github, docs
  • incogni.to - 🌗 an event platform that curates for those yearning to be seen for who they are, not what they can "sell" (2024)

  • onw - A real-time navigation tools for safer commute (2021)

    • Implemented route optimization, heat map visualization to identify hot zones, peer notification system.
    • Added a heuristic Gaussian Mixture Model to find the safest path between different locations, trained on past assault data provided by Toronto Police Department.
    • Awarded: Finalists at Hack the North 2021.
    • Built using AWS Fargate, React Native, TypeScript, GraphQL, Apache Spark MLlib, Google Maps API
    • Source: github, devpost

talks.

  • OpenLLM, and everything about running LLMs in production at Hack The North (2023)

companies.

Bibliographie

  • Templeton, A., Conerly, T., Marcus, J., Lindsey, J., Bricken, T., Chen, B., Pearce, A., Citro, C., Ameisen, E., Jones, A., Cunningham, H., Turner, N. L., McDougall, C., MacDiarmid, M., Freeman, C. D., Sumers, T. R., Rees, E., Batson, J., Jermyn, A., … Henighan, T. (2024). Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet. Transformer Circuits Thread. [link]
  • Yang, C., Sean, S., Aaron, P., Shenyang, Z., Sauyon, L., Bo, J., Fog, D., Xipeng, G., & Frost, M. (n.d.). BentoML: The framework for building reliable, scalable and cost-efficient AI application. [GitHub]