Show HN: Pure CUDA C Inference for Qwen3 0.6B in One File, No Dependencies github.com 1 points by yb0000 17 hours ago
[dead]