APC, SD, and SF

Explanation of Automatic Prefix Caching (APC), Speculative Decoding (SD), and Split Fuse (SF).

Jan-13-2025 · 2 min · 394 words · jamesnulliu

Create A LibTorch Project

How to create a LibTorch project.

Dec-23-2024 · 7 min · 1306 words · jamesnulliu

VSCode: Debug Python

This post shows how to configure launch.json in VSCode for debugging Python.

Oct-09-2024 · 1 min · 173 words · jamesnulliu

Dive into Paged Attention

Dive into the paged attention mechanism of vLLM.

Oct-07-2024 · 11 min · 5109 words · jamesnulliu

A Simple Pytorch Trainpipeline

How to build a simple Pytorch trainpipeline.

Jun-30-2024 · 4 min · 714 words · jamesnulliu