def llm_key_terms():
terms = {
'tokens': 'a single unit of text',
'embedding': 'convert tokens into vectors',
'vectors': 'numerical representations of tokens',
'vectorstore': 'a database that stores vectors',
'transformers': 'neural network architecture used in LLMs',
'pre-training': 'training a model on a large dataset of text',
'fine tuning': 'training a pre-trained model on a specific task',
'RAG' : 'prompting method that uses retrieval to add more context to a prompt',
}
return terms
- Attention is all you need: https://arxiv.org/abs/1706.03762
- good article: https://thelowdown.momentum.asia/the-emergence-of-large-language-models-llms/
- excellent video: https://www.youtube.com/watch?v=osKyvYJ3PRM
- great read by elastic: https://www.elastic.co/what-is/large-language-models
- elastsearch langchain: https://www.elastic.co/search-labs/blog/large-language-models-elastic-code-langchain
- Demystifyig LLMs: https://github.blog/2023-10-27-demystifying-llms-how-they-can-do-things-they-werent-trained-to-do/
- AI for Beginners repo: https://github.com/microsoft/AI-For-Beginners
- AL, ML, DL, GenAI: https://synoptek.com/insights/it-blogs/data-insights/ai-ml-dl-and-generative-ai-face-off-a-comparative-analysis/
- linkedin post: https://www.linkedin.com/posts/srikanth-reddy-98196712_ai-mldl-what-is-generative-ai-what-activity-7125167272247115777-Nobw/
- How LLMs work video: https://www.youtube.com/watch?v=5sLYAQS9sWQ
- Gen AI in a nutshell: https://www.youtube.com/watch?v=2IK3DFHRFfw
- How GitHub Copilot handles data: https://resources.github.com/learn/pathways/copilot/essentials/how-github-copilot-handles-data/
- Top Open LLMs: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
- The LLM Jourey: https://www.youtube.com/watch?v=jM72J9wPQQ0
- Story of Language Models: https://www.assemblyai.com/blog/the-full-story-of-large-language-models-and-rlhf/
- A Practical intro to LLMs (very good video!): https://www.youtube.com/watch?v=tFHeUSJAYbE
- build a LLM from scratch repo: https://github.com/rasbt/LLMs-from-scratch
- what is RAG: https://github.blog/2024-04-04-what-is-retrieval-augmented-generation-and-what-does-it-do-for-generative-ai/
- fine tuning LLMs: https://github.blog/2024-02-28-customizing-and-fine-tuning-llms-what-you-need-to-know/
- using GitHub Copilot: https://github.blog/2024-03-25-how-to-use-github-copilot-in-your-ide-tips-tricks-and-best-practices/
- visualize tokens: https://platform.openai.com/tokenizer