hackerllama/blog/posts/hitchhiker_guide/

Question

hackerllama/blog/posts/hitchhiker_guide/

utterances-bot opened this issue a year ago · 8 comments

utterances-bot commented a year ago

hackerllama - The Llama Hitchiking Guide to Local LLMs

https://osanseviero.github.io/hackerllama/blog/posts/hitchhiker_guide/

Answer 1 · 2024-01-21T09:20:19.000Z

This is amazing, thanks !

Answer 2 · 2024-01-21T22:57:59.000Z

great job!

Answer 3 · 2024-01-22T06:41:58.000Z

Great overview of the different concepts, discovered many! thanks @osanseviero

Answer 4 · 2024-01-23T04:41:06.000Z

great job! marking this post

Answer 5 · 2024-02-06T16:21:51.000Z

Good stuff. Would be nice to have a dive into Embeddings and tooling around it.

Answer 6 · 2024-03-11T16:47:10.000Z

Good post! One comment is that Flash Attention is not an approximation of attention but it is exact, meaning it computes the exact attention calculation. It achieves the speedup through optimized memory access and parallel processing techniques.

Answer 7 · 2024-05-18T04:59:33.000Z

This is an incredibly useful article. Thank you @osanseviero for maintaining this.

Answer 8 · 2024-11-02T01:46:58.000Z

Very helpful!