Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

Show HN: Pure CUDA C Inference for Qwen3 0.6B in One File, No Dependencies

github.com

1 points by yb0000 17 hours ago

yb0000 17 hours ago

[dead]