Stephen Carmody

A place to write about AI topics and ML in production

Posts Interesting Papers About

Blog Posts

Most Recent Posts

    Vector Databases Explained: Search in era of AI - 16 March 2025

2024

    June 2024

    A Guide to LLM Inference (Part 4): Speculative Decoding & Batching - 2 June 2024

    May 2024

    A Guide to LLM Inference (Part 3): Model Compression - 19 May 2024

    A Guide to LLM Inference (Part 2): Attention Optimisation - 5 May 2024

    April 2024

    A Guide to LLM Inference (Part 1): Foundations - 21 April 2024

    A brief Introduction to LLMOps - 7 April 2024

    March 2024

    Fine-Tuning Pre Trained Models - 24 March 2024

    An Introduction to the Transformer Architecture (Part 2) - 10 March 2024

    February 2024

    An Introduction to the Transformer Architecture (Part 1) - 25 February 2024

    Hello World - 11 February 2024

Oldest Posts