learning and implementing generative AI tools
LLM Fine-tuning:
- Ziegler, Daniel M., Nisan Stiennon, Jeffrey Wu, Tom B. Brown, Alec Radford, Dario Amodei, Paul Christiano, and Geoffrey Irving. "Fine-tuning language models from human preferences." arXiv preprint arXiv:1909.08593 (2019). paper
- Rafailov, Rafael, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, and Chelsea Finn. "Direct preference optimization: Your language model is secretly a reward model." arXiv preprint arXiv:2305.18290 (2023). paper
- Christiano, Paul F., Jan Leike, Tom Brown, Miljan Martic, Shane Legg, and Dario Amodei. "Deep reinforcement learning from human preferences." Advances in neural information processing systems 30 (2017).paper
- Bai et al 2022. Constitutional AI: Harmlessness from AI Feedback paper
- EHR
- Clinical workflow, chart summarization.
- Genomics
- Informed Consent
- Research Ethics related documents.
- Mirza et al (2024) NEJM AI. Using ChatGPT to Facilitate Truly Informed Medical Consent. paper
- Tutorials
- Huggingface