We need to talk about the 'magic' behind Claude’s CUDA kernels. Is it superior synthetic data, or did Anthropic find a better way to teach LLMs hardware-level logic? Open to all technical theories
Baleeshwar Palavadi
aim143
·
AI & ML interests
None yet
Recent Activity
commentedon an article about 1 month ago
We Got Claude to Build CUDA Kernels and teach open models! updated a dataset about 2 years ago
aim143/guanaco-llama2-500 liked a model about 2 years ago
aim143/tinystarcoder-rlhf-model