This repository organizes a timeline of key events (products, services, papers, GitHub, blog posts and news) that occurred before and after the ChatGPT announcement.
It's curating a variety of information in this timeline, with a particular focus on LLM and Generative AI.
Maybe it's a scene from the hottest history, so I thought it would be important to keep those memories well, so I organized them.
Issues and Pull Requests are greatly appreciated. If you've never contributed to an open source project before I'm more than happy to walk you through how to create a pull request.
You can start by opening an issue describing the problem that you're looking to resolve and we'll go from there.
This document is licensed under the MIT license © Jonghong Jeon
Date | Announcement |
---|---|
5.28 | Introducing NVIDIA ACE For Games - Spark Life Into Virtual Characters With Generative AI (blog) |
5.27 | WingmanAI - real-time transcription of audio, integrated with ChatGPT for interactive use (GitHub) |
5.27 | ToolBench - Large-scale instruction tuning SFT data to equip LLMs with general tool-use capability (GitHub) |
5.27 | G7 officials to hold first meeting on AI regulation next week (news) |
5.26 | Backpack Language Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.26 | Impossible Distillation: from Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.26 | Playing repeated games with Large Language Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.26 | Training Socially Aligned Language Models in Simulated Human Society (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.26 | BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks (arXiv), (PDF), (arXiv-vanity), (paper page) |
5.26 | Large Language Models as Tool Makers (arXiv), (PDF), (arXiv-vanity), (paper page) |
5.26 | ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (project page) |
5.25 | Role-Play with Large Language Models (arXiv), (PDF), (arXiv-vanity), (paper page) |
5.25 | Break-A-Scene: Extracting Multiple Concepts from a Single Image (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.25 | Voyager: An Open-Ended Embodied Agent with Large Language Models (Project page), (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (GitHub), (MindDojo) |
5.25 | Efficient Neural Music Generation (arXiv), (PDF), (arXiv-vanity), (paper page) |
5.25 | Custom-Edit: Text-Guided Image Editing with Customized Diffusion Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.25 | On Architectural Compression of Text-to-Image Diffusion Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.25 | Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (GitHub) |
5.25 | The False Promise of Imitating Proprietary LLMs (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.25 | the new Stable Diffusion “Reimagine XL” model on @ClipdropApp x @StabilityAI (tweet), (Clipdrop) |
5.25 | Gorilla: Large Language Model Connected with Massive APIs (tweet), (project page), (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (GitHub), (demo video), (discord) |
5.25 | OpenAI - Democratic Inputs to AI (Tweet), (Blog) |
5.24 | Think Before You Act: Decision Transformers with Internal Working Memory (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.24 | PandaGPT: One Model to Instruction-Follow Them All (project page), (PDF), (demo), (video), (dataset), (model), (GitHub), (tweet) |
5.24 | SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.24 | Manifold Diffusion Fields (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.24 | A Neural Space-Time Representation for Text-to-Image Personalization (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (GitHub) |
5.24 | Can Transformers Learn to Solve Problems Recursively? (arXiv), (PDF), (arXiv-vanity), (paper page) |
5.24 | This Land is {Your, My} Land: Evaluating Geopolitical Biases in Language Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.24 | Model evaluation for extreme risks (arXiv), (PDF), (arXiv-vanity), (paper page) |
5.24 | State of GPT and RLHF LLMs - Andrej Karpathy, OpenAI (session), (video) |
5.24 | LMs with a Voice: Spoken Language Modeling beyond Speech Tokens (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (GitHub) |
5.24 | BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (GitHub), (Project page) |
5.23 | OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.23 | Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.23 | Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.23 | Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.23 | Aligning Large Language Models through Synthetic Feedback (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.23 | LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (GitHub) |
5.23 | Lost in Translation: Large Language Models in Non-English Content Analysis (news) |
5.23 | Anchor Prediction: Automatic Refinement of Internet Links (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.23 | Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.23 | Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.23 | PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.23 | Bing at Microsoft Build 2023: Continuing the Transformation of Search (blog) |
5.23 | Bringing the power of AI to Windows 11 – unlocking a new era of productivity for customers and developers with Windows Copilot and Dev Home (blog) |
5.23 | Adobe Unveils Future of Creative Cloud With Generative AI as a Creative Co-Pilot in Photoshop (news), (blog) |
5.23 | QLoRA: Efficient Finetuning of Quantized LLMs (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (GitHub) |
5.22 | SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation (arXiv), (PDF), (arXiv-vanity), (papers with code) |
5.22 | Meta-in-context learning in large language models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.22 | AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation (arXiv), (PDF), (arXiv-vanity), (papers with code), (GitHub) |
5.22 | Iterative Forward Tuning Boosts In-context Learning in Language Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.22 | How Language Model Hallucinations Can Snowball (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (demo) |
5.22 | Intel Announces Aurora genAI, Generative AI Model With 1 Trillion Parameters (news), (Intel newsroom) |
5.22 | Introducing Mind-Video (Tweet), (demo), (data) |
5.22 | Reflective Linguistic Programming (RLP): A Stepping Stone in Socially-Aware AGI (SocialAGI) (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.22 | GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.22 | LM vs LM: Detecting Factual Errors via Cross Examination (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.22 | XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages (PDF), (GitHub) |
5.22 | VideoLLM: Modeling Video Sequence with Large Language Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.22 | RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.22 | RWKV: Reinventing RNNs for the Transformer Era (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.22 | Introducing speech-to-text, text-to-speech, and more for 1,100+ languages (Blog), (PDF), (GitHub) |
5.21 | Augmenting Autotelic Agents with Large Language Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.21 | XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models (GitHub), (Video) |
5.20 | G7 Hiroshima Leaders’ Communiqué (statement), (html) |
5.20 | G7 calls for developing global technical standards for AI (news) |
5.20 | Labour should pledge £11bn to build ‘BritGPT’ AI, thinktank says (news) |
5.20 | CodeCompose: A Large-Scale Industrial Deployment of AI-assisted Code Authoring (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (huggingface), (GitHub) |
5.19 | New York City public schools remove ChatGPT ban (news) |
5.19 | Graphologue: Exploring Large Language Model Responses with Interactive Diagrams (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | The Inside Story: Towards Better Understanding of Machine Translation Neural Evaluation Metrics (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (GitHub) |
5.19 | HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | Characterizing tradeoffs between teaching via language and demonstrations in multi-agent systems (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | Comparing Software Developers with ChatGPT: An Empirical Investigation (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (GitHub) |
5.19 | Multimodal Web Navigation with Instruction-Finetuned Foundation Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | Scaling laws for language encoding models in fMRI (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | Any-to-Any Generation via Composable Diffusion (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (GitHub) |
5.19 | ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.19 | Apple Bans Employees From Using ChatGPT Amid Its Own AI Efforts (news) |
5.18 | Brain-inspired learning in artificial neural networks: a review (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.18 | ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.18 | RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.18 | LIMA: Less Is More for Alignment (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.18 | GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework (arXiv), (PDF), (arXiv-vanity), (paper page), (project page), (papers with code), (GitHub), (Star history) |
5.18 | SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.18 | mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.18 | Language Models Meet World Models: Embodied Experiences Enhance Language Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.18 | Roundhill Investments Launches Generative AI & Technology ETF (NYSE Arca: CHAT) (news), (CHAT ETF) |
5.18 | VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.18 | Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (Huggingfae), (Unofficial), (colab), (Official) |
5.18 | PyLLMs - a minimal Python library to connect to LLMs (OpenAI, Anthropic, Google, AI21, Cohere, Aleph Alpha, HuggingfaceHub) (GitHub) |
5.18 | Evidence of Meaning in Language Models Trained on Programs (arXiv), (PDF), (arXiv-vanity), (papers with code) |
5.18 | Introducing the ChatGPT app for iOS (blog), (Download on the App Stor) |
5.18 | MTIA v1: Meta’s first-generation AI inference accelerator (blog) |
5.18 | Pursuing groundbreaking scale and accelerating research using Meta’s Research SuperCluster (blog) |
5.18 | Reimagining Meta’s infrastructure for the AI age (blog) |
5.17 | Chain-of-Symbol Prompting Elicits Planning in Large Langauge Models (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.17 | DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
5.17 | Explaining black box text modules in natural language with language models (arXiv), (PDF), (arXiv-vanity), (paper page), (project page), (papers with code |
5.17 | Tree of Thoughts: Deliberate Problem Solving with Large Language Models (arXiv), (PDF), (arXiv-vanity), (papers with code) |
5.17 | Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback (arXiv), (PDF), (arXiv-vanity), (papers with code) |
5.17 | PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering (arXiv), (PDF), (arXiv-vanity), (🏆papers with code) |
5.17 | What You See is What You Read? Improving Text-Image Alignment Evaluation (arXiv), (PDF), (arXiv-vanity), (papers with code) |
5.17 | PaLM 2 Technical Report (arXiv), (PDF), (arXiv-vanity), (🏆papers with code) |
5.17 | Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback (arXiv), (PDF), (arXiv-vanity), (papers with code) |
5.17 | SoundStorm: Efficient Parallel Audio Generation (arXiv), (PDF), (arXiv-vanity), (Project page), (papers with code) |
5.16 | AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation (arXiv), (PDF), (arXiv-vanity) |
5.16 | Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation (arXiv), (PDF), (arXiv-vanity) |
5.16 | ChatGPT versus human in generating medical graduate exam questions – An international prospective study (medRxiv), (PDF) |
5.16 | Understanding 3D Object Interaction from a Single Image (arXiv), (PDF), (arXiv-vanity), (project page), (demo), (video), (GitHub) |
5.16 | StructGPT: A General Framework for Large Language Model to Reason over Structured Data (arXiv), (PDF), (arXiv-vanity), (papers with code) |
5.16 | FitMe: Deep Photorealistic 3D Morphable Model Avatars (arXiv), (PDF), (arXiv-vanity), (project page) |
5.16 | Pre-Training to Learn in Context (arXiv), (PDF), (arXiv-vanity), (papers with code) |
5.16 | Towards Expert-Level Medical Question Answering with Large Language Models (arXiv), (PDF), (arXiv-vanity), (🏆papers with code) |
5.16 | GPTeam: Collaborative AI Agents (GitHub) |
5.16 | WATCH LIVE: OpenAI CEO Sam Altman testifies on artificial intelligence before Senate committee (Youtube) |
5.16 | NYT - Microsoft Says New A.I. Shows Signs of Human Reasoning |
5.15 | Common Diffusion Noise Schedules and Sample Steps are Flawed (arXiv), (PDF), (arXiv-vanity) |
5.15 | Symbol tuning improves in-context learning in language models (arXiv), (PDF), (arXiv-vanity) |
5.15 | Interpretability at Scale: Identifying Causal Mechanisms in Alpaca (arXiv), (PDF), (arXiv-vanity) |
5.15 | DarkBERT: A Language Model for the Dark Side of the Internet (arXiv), (PDF), (arXiv-vanity) |
5.15 | AutoRecon: Automated 3D Object Discovery and Reconstruction (arXiv), (PDF), (arXiv-vanity), (Project page) |
5.15 | RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs (arXiv), (PDF), (arXiv-vanity), (papers with code) |
5.15 | Small Models are Valuable Plug-ins for Large Language Models (arXiv), (PDF), (arXiv-vanity) |
5.15 | "ChatGPT can pick stocks better then top fund managers" - The ChatGPT Fund - (tweet), (website) |
5.15 | officially launching the Poe API - (Tweet, (GitHub): (poe-protocol), (api-bot-tutorial) |
5.15 | Guidance - A guidance language for controlling large language models (GitHub) |
5.15 | BriefGPT - Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI (GitHub) |
5.15 | I’m an ER doctor. Here’s how I’m already using ChatGPT to help treat patients (blog) |
5.14 | How to run Llama 13B with a 6GB graphics card (Gist) |
5.13 | Leaked Copilot Chat's confidential rules (tweet) |
5.13 | GPT-Sentinel: Distinguishing Human and ChatGPT Generated Content (arXiv](https://arxiv.org/abs/2305.07969)), (PDF), (arXiv-vanity) |
5.13 | Everything-LLMs-And-Robotics - The world's largest GitHub Repository for LLMs + Robotics (GitHub) |
5.13 | CodeT5+: Open Code Large Language Models for Code Understanding and Generation arXiv), (PDF), (arXiv-vanity), (GitHub), (🏆papers with code) |
5.13 | EU AI Act To Target US Open Source Software (Blog) |
5.13 | PCAST Working Group on Generative AI Invites Public Input (Blog) |
5.12 | spacy-llm, an extension for integrating LLMs into structured NLP pipelines! (GitHub), (tweet) |
5.12 | TinyStories: How Small Can Language Models Be and Still Speak Coherent English? (arXiv), (PDF), (arXiv-vanity) |
5.12 | Dr. LLaMA: Improving Small Language Models in Domain-Specific QA via Generative Data Augmentation (arXiv), (PDF), (arXiv-vanity), (model), (GitHub) |
5.12 | ArtGPT-4: Artistic Vision-Language Understanding with Adapter-enhanced MiniGPT-4 (arXiv), (PDF), (arXiv-vanity), (model) |
5.12 | MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers (arXiv), (PDF), (arXiv-vanity) |
5.12 | AI FILM -The Carnival of the Ages - Runway gen2 (Youtube), (Reddit) |
5.11 | Large Language Models Can Be Used To Effectively Scale Spear Phishing Campaigns (arXiv), (PDF), (arXiv-vanity) |
5.11 | Towards best practices in AGI safety and governance: A survey of expert opinion (arXiv), (PDF), (arXiv-vanity) |
5.11 | Optimizing Memory Mapping Using Deep Reinforcement Learning (arXiv), (PDF), (arXiv-vanity) |
5.11 | Universal Source Separation with Weakly Labelled Data (arXiv), (PDF), (arXiv-vanity), (GitHub) |
5.11 | Active Retrieval Augmented Generation (arXiv), (PDF), (arXiv-vanity), (GitHub) |
5.11 | Anthropic - Introducing 100K Context Windows (Blog) |
5.11 | CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model (arXiv), (PDF), (arXiv-vanity) |
5.11 | Exploiting Diffusion Prior for Real-World Image Super-Resolution (arXiv), (PDF), (arXiv-vanity), (Project page) |
5.11 | Domain Incremental Lifelong Learning in an Open World (arXiv), (PDF), (arXiv-vanity) |
5.11 | Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting (arXiv), (PDF), (arXiv-vanity) |
5.11 | Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers (arXiv), (PDF), (arXiv-vanity) |
5.11 | EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention (arXiv), (PDF), (arXiv-vanity), (GitHub) |
5.11 | InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning (arXiv), (PDF), (arXiv-vanity), (GitHub) |
5.11 | Huggingface Transformers Agent (API) |
5.11 | Google PaLM 2 Technical Report (PDF), (Blog) |
5.11 | Google MusicLM (Demo), (news) |
5.10 | HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion (arXiv), (PDF), (arXiv-vanity) |
5.10 | VideoChat: Chat-Centric Video Understanding (arXiv), (PDF), (arXiv-vanity) |
5.10 | Bot or Human? Detecting ChatGPT Imposters with A Single Question (arXiv), (PDF), (arXiv-vanity) |
5.10 | Do LLMs Understand User Preferences? Evaluating LLMs On User Rating Prediction (arXiv), (PDF), (arXiv-vanity) |
5.10 | Relightify: Relightable 3D Faces from a Single Image via Diffusion Models (arXiv), (PDF), (arXiv-vanity) |
5.10 | Similarity of Neural Network Models: A Survey of Functional and Representational Measures (arXiv), (PDF), (arXiv-vanity) |
5.10 | Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era (arXiv), (PDF), (arXiv-vanity) |
5.10 | MPT-7B StoryWriter- new open-source language model that can handle really long inputs (Replicate) |
5.10 | Humata.ai - Ask AI anything about your files (Tweet) |
5.10 | IMAGEBIND: One Embedding Space To Bind Them All (PDF), (Blog), (GitHub), (🏆papers with code), (star history) |
5.9 | StarCoder: may the source be with you! (arXiv), (PDF), (arXiv-vanity), (Paper page) |
5.9 | Towards Building the Federated GPT: Federated Instruction Tuning (arXiv), (PDF), (arXiv-vanity), (GitHub) |
5.9 | Large Language Model Programs (arXiv), (PDF), (arXiv-vanity) |
5.9 | FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance (arXiv), (PDF), (arXiv-vanity) |
5.9 | OpenAI - Language models can explain neurons in language models (Blog), (Paper), (GitHub), (Tweet) |
5.9 | AvatarReX: Real-time Expressive Full-body Avatars (arXiv), (PDF), (arXiv-vanity) |
5.8 | Augmented Large Language Models with Parametric Knowledge Guiding (arXiv), (PDF), (arXiv-vanity) |
5.8 | We had ChatGPT take the CPA exam — and it failed (news) |
5.8 | Comparison of GPT-3.5, GPT-4, and human user performance on a practice ophthalmology written examination (Nature) |
5.8 | MultiModal-GPT: A Vision and Language Model for Dialogue with Humans (arXiv), (PDF), (arXiv-vanity), (GitHub), (Paper page), (Star history) |
5.7 | Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models (arXiv), (PDF), (arXiv-vanity), (GitHub) |
5.7 | X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages (arXiv), (PDF), (arXiv-vanity) |
5.7 | Multi-Space Neural Radiance Fields (arXiv), (PDF), (arXiv-vanity), (Project page), (Dataset) |
5.7 | Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting (arXiv), (PDF), (arXiv-vanity) |
5.7 | Yoshua Bengio - AI Scientists: Safe and Useful AI? (Blog) |
5.5 | privateGPT - Interact privately with your documents using the power of GPT, 100% privately, no data leaks (GitHub), (star history) |
5.5 | Open LLMs : A list of open LLMs available for commercial use - (GitHub) |
5.5 | A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding (arXiv), (PDF), (arXiv-vanity), (Paper page) |
5.5 | Otter: A Multi-Modal Model with In-Context Instruction Tuning (arXiv), (PDF), (arXiv-vanity), (GitHub), (Paper page) |
5.5 | Composite Motion Learning with Task Control (arXiv), (PDF), (arXiv-vanity), (GitHub), (Papper page) |
5.5 | StarCoderBase: trained on 1T tokens in 80+ programming languages (Huggingface) |
5.5 | Dolphin: General video interaction platform based on LLMs (Demo), (GitHub), (Tweet) |
5.5 | MPT-7B: A New Standard for Open-Source, Commercially Usable LLMs (Blog), Commercially usable: (MPT-7B) (MPT-7B-Instruct), (MPT-7B-StoryWriter), For non-commerical use: (MPT-7B-Chat) |
5.5 | StarCoder: A State-of-the-Art LLM for Code (Blog), (GitHub), (HuggingFace), (Tweet) |
5.5 | OpenAlpaca, an instruction-following model based on OpenLLaMA (GitHub), (Huggingface), (Tweet) |
5.4 | Seeing is Believing: Brain-Inspired Modular Training for Mechanistic Interpretability (arXiv), (PDF), (arXiv-vanity), (Github), (demo), (Papper page) |
5.4 | Evaluating the Performance of ChatGPT in Ophthalmology: An Analysis of its Successes and Shortcomings (Ophthalmology Science) |
5.4 | Cognitive Reframing of Negative Thoughts through Human-Language Model Interaction (arXiv), (PDF), (arXiv-vanity) |
5.4 | Governance of the AI, by the AI, and for the AI (arXiv), (PDF), (arXiv-vanity), (Papper page) |
5.4 | Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs (arXiv), (PDF), (arXiv-vanity) |
5.4 | Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion (arXiv), (PDF), (arXiv-vanity), (Papper page) |
5.4 | AttentionViz: A Global View of Transformer Attention (arXiv), (PDF), (arXiv-vanity), (Papper page) |
5.4 | Reddit - OpenAI lost $540M in 2022, will need $100B more to develop AGI, says Altman. My breakdown on why this matters and what it means for other AI startups |
5.4 | FACT SHEET: Biden-Harris Administration Announces New Actions to Promote Responsible AI Innovation that Protects Americans’ Rights and Safety - (White house) |
5.4 | Google "We Have No Moat, And Neither Does OpenAI" - (Blog) |
5.4 | CNBC - Britain launches probe into ChatGPT-style A.I. as regulators grow concerned by risks |
5.4 | Personalize Segment Anything Model with One Shot (arXiv), (PDF), (arXiv-vanity), (GitHub), (Paper page) |
5.4 | AutoML-GPT: Automatic Machine Learning with GPT (arXiv), (PDF), (arXiv-vanity), (Paper page) |
5.4 | NeRSemble: Multi-view Radiance Field Reconstruction of Human Heads (arXiv), (PDF), (arXiv-vanity, (Project page), (Paper page) |
5.4 | An automatically discovered chain-of-thought prompt generalizes to novel models and datasets (arXiv), (PDF), (arXiv-vanity) |
5.4 | NYT - White House Pushes Tech C.E.O.s to Limit Risks of A.I. |
5.4 | Microsoft Bing AI chatbot and Edge browser get massive AI upgrades. See the list. (Blog) |
5.4 | Introducing Slack GPT (Blog) |
5.3 | Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings - (Blog) |
5.3 | CodeGen2: Lessons for Training LLMs on Programming and Natural Languages (arXiv), (PDF), (arXiv-vanity), (GitHub) |
5.3 | Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes (arXiv), (PDF), (arXiv-vanity) |
5.3 | Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings (arXiv), (PDF), (arXiv-vanity) |
5.3 | AG3D: Learning to Generate 3D Avatars from 2D Image Collections (arXiv), (PDF), (arXiv-vanity), (Project page) |
5.3 | Shap-E: Generating Conditional 3D Implicit Functions (arXiv), (PDF), (arXiv-vanity), (GitHub), (Paper page) |
5.3 | 100 Practical Applications and Use Cases of Generative AI - (PDF), (News) |
5.3 | Comprehensive LLM model zoo - Ecosystem Graphs to track the foundation model ecosystem assets (datasets, models, and applications) and their relationship (Table), (Graph), (GitHub) |
5.3 | GPTutor: a ChatGPT-powered programming tool for code explanation (arXiv), (PDF), (arXiv-vanity) |
5.3 | Midjourney 5.1 Arrives - And It’s Another Leap Forward For AI Art - (Forbes) |
5.3 | Mojo 🔥 — a new programming language for all AI developers (Web), (tweet), (GitHub) |
5.3 | #NeurIPS2023 Creative AI Track (Blog), (Call for proposal) |
5.3 | HeyPi - Personal AI |
5.2 | Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl (arXiv), (PDF), (arXiv-vanity) |
5.2 | Andrew Ng - ChatGPT Prompt Engineering for Developers - (online course), (Tweet) |
5.2 | DreamPaint: Few-Shot Inpainting of E-Commerce Items for Virtual Try-On without 3D Modeling (arXiv), (PDF), (arXiv-vanity) |
5.2 | Generalizing Dataset Distillation via Deep Generative Prior (arXiv), (PDF), (arXiv-vanity) |
5.2 | Multimodal Procedural Planning via Dual Text-Image Prompting (arXiv), (PDF), (arXiv-vanity), (GitHub) |
5.2 | WSJ - Google DeepMind CEO Says Some Form of AGI Possible in a Few Years |
5.2 | Latest NVIDIA Graphics Research Advances Generative AI’s Next Frontier (Blog) |
5.2 | Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation (arXiv), (PDF), (arXiv-vanity), (GitHub) |
5.2 | TMR: Text-to-Motion Retrieval Using Contrastive 3D Human Motion Synthesis (arXiv), (PDF), (arXiv-vanity), (Project page), (Demo) |
5.2 | Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation (arXiv), (PDF), (arXiv-vanity), (GitHub) |
5.2 | Unlimiformer: Long-Range Transformers with Unlimited Length Input (arXiv), (PDF), (arXiv-vanity) |
5.2 | Bark - Text-Prompted Generative Audio Model (GitHub) |
5.2 | Jsonformer: A Bulletproof Way to Generate Structured JSON from Language Models (GitHub) |
5.1 | scGPT: Towards Building a Foundation Model for Single-Cell Multi-omics Using Generative AI (bioXiv), (PDF) |
5.1 | The Guardian - AI makes non-invasive mind-reading possible by turning thoughts into text |
5.1 | Learning to Reason and Memorize with Self-Notes (arXiv), (PDF), (arXiv-vanity) |
5.1 | Poisoning Language Models During Instruction Tuning (arXiv), (PDF), (arXiv-vanity) |
5.1 | What Do Self-Supervised Vision Transformers Learn? (arXiv), (PDF), (arXiv-vanity) |
5.1 | NYT - ‘The Godfather of A.I.’ Leaves Google and Warns of Danger Ahead (Archive) |
4.30 | ChatGPT: Is this version good for healthcare and research? - (ScienceDirect) |
4.30 | Understanding Parameter-Efficient LLM Finetuning: Prompt Tuning And Prefix Tuning (Blog) |
4.30 | A brief history of LLaMA models (Blog) |
4.30 | BabyBeeAGI: Task Management and Functionality Expansion on top of BabyAGI (blog), (Replit), (GitHub), (OG BaybyAGI) |
4.30 | Results of G7 Digital and Tech Ministers’ Meeting in Takasaki, Gunma - (Summary), (Declaration), (Annex1), (Annex2), (Annex3), (Annex4), (Annex5) |
4.30 | PandaLM: Reproducible and Automated Language Model Assessment (GitHub) |
4.29 | Can ChatGPT Pass An Introductory Level Functional Language Programming Course? (arXiv), (PDF), (arXiv-vanity) |
4.29 | A Review of ChatGPT Applications in Education, Marketing, Software Engineering, and Healthcare: Benefits, Drawbacks, and Research Directions (arXiv), (PDF), (arXiv-vanity) |
4.29 | ChatGPT-2D, which can generate mind maps with AI - (Tweet), (ChatGPT-2D) |
4.29 | MLC LLM - an open framework that brings language models (LLMs) directly into a broad class of platforms (CUDA, Vulkan, Metal) with GPU acceleration (Tweet), (Demo), (GitHub) |
4.29 | GenOs Index - The April (aka the Frenetic Pace) Edition - (blog) |
4.29 | StableVicuna, the AI World’s First Open Source RLHF LLM Chatbot! - (Blog), (Tweet) |
4.29 | DeepFloyd - a state-of-the-art text-to-image model (Web), (GitHub), (HuggingFace demo), (Tweet) |
4.29 | When Patient Questions Are Answered With Higher Quality and Empathy by ChatGPT than Physicians - (Blog) |
4.29 | BMTools - Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins (GitHub) |
4.29 | FastChat-T5 (GitHub), (Tweet) |
4.29 | Lamini, the LLM Engine for Rapidly Customizing Models - (Blog) |
4.28 | EU proposes new copyright rules for generative AI - (Reuter), (Economic times) |
4.28 | PROMPTENGINEERING FORCHATGPTA QUICKGUIDE TOTECHNIQUES, TIPS,ANDBESTPRACTICES - (PDF) |
4.28 | ResiDual: Transformer with Dual Residual Connections (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.28 | Causal Reasoning and Large Language Models: Opening a New Frontier for Causality (arXiv), (PDF), (arXiv-vanity) |
4.28 | We Interviewed the Engineer Google Fired for Saying Its AI Had Come to Life (Futurism) |
4.28 | LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.28 | MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks (arXiv), (PDF), (arXiv-vanity) |
4.28 | Are Emergent Abilities of Large Language Models a Mirage? (arXiv), (PDF), (arXiv-vanity) |
4.28 | The Ultimate Battle of Language Models: Lit-LLaMA vs GPT3.5 vs Bloom vs …. (Blog) |
4.28 | Otter, a multi-modal in-context learning model with instruction tuning - (GitHub), (Demo), (Youtube) |
4.28 | Economist - Yuval Noah Harari argues that AI has hacked the operating system of human civilisation (Archive) |
4.28 | Assessing the Potential of USMLE-Like Exam Questions Generated by GPT-4 (medRxiv), (PDF) |
4.28 | JAMA - Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum - (paper) |
4.27 | ChatGPT as an Attack Tool: Stealthy Textual Backdoor Attack via Blackbox Generative Model Trigger (arXiv), (PDF), (arXiv-vanity) |
4.27 | PMC-LLaMA: Further Finetuning LLaMA on Medical Papers (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.27 | "Can ChatGPT Diagnose Me?" How Large Language Models will Transform Clinical Care - (Youtube) |
4.27 | Large Language Models Are State-of-the-Art Evaluators of Code Generation (arXiv), (PDF), (arXiv-vanity) |
4.27 | Controlled Text Generation with Natural Language Instructions (arXiv), (PDF), (arXiv-vanity) |
4.27 | A Survey of Large Language Models - version 8 (arXiv), (PDF), (arXiv-vanity) |
4.27 | LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions (arXiv), (PDF), (arXiv-vanity), ([GitHub](https://github.com/m) |
4.27 | DataComp: In search of the next generation of multimodal datasets (arXiv), (PDF), (arXiv-vanity), (GitHub), (Project page) |
4.27 | We're Afraid Language Models Aren't Modeling Ambiguity (arXiv), (PDF), (arXiv-vanity) |
4.27 | Boston Dynamics robot dog can answer your questions now, thanks to ChatGPT - (ZDNet), (YouTube) |
4.27 | LlamaIndex & Deep Lake for Financial Statement Analysis (Blog) |
4.26 | Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning (arXiv), (PDF), (arXiv-vanity) |
4.26 | Multidimensional Evaluation for Text Style Transfer Using ChatGPT (arXiv), (PDF), (arXiv-vanity) |
4.26 | NPJ - Comparing scientific abstracts generated by ChatGPT to real abstracts with detectors and blinded human reviewers (Paper), (PDF) |
4.26 | TopGPT — the world’s first Andrew Tate large language model |
4.26 | Multi-Party Chat: Conversational Agents in Group Settings with Humans and Models (arXiv), (PDF), (arXiv-vanity) |
4.26 | MOSS, a 16B tool-augmented conversational language model (Tweet), (GitHub) |
4.26 | Exploring the Curious Case of Code Prompts (arXiv), (PDF), (arXiv-vanity) |
4.26 | Controllable Image Generation via Collage Representations (arXiv), (PDF), (arXiv-vanity) |
4.26 | Unleashing Infinite-Length Input Capacity for Large-scale Language Models with Self-Controlled Memory System (arXiv), (PDF), (arXiv-vanity) |
4.26 | TextDeformer: Geometry Manipulation using Text Guidance (arXiv), (PDF), (arXiv-vanity) |
4.26 | Evaluation of GPT-3.5 and GPT-4 for supporting real-world information needs in healthcare delivery (arXiv), (PDF), (arXiv-vanity) |
4.26 | Ray Conditioning: Trading Photo-consistency for Photo-realism in Multi-view Image Generation (arXiv), (PDF), (arXiv-vanity), (Project page) |
4.26 | Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.26 | HuggingChat - the first open source alternative to ChatGPT |
4.25 | Time - The 'Don't Look Up' Thinking That Could Doom Us With AI (Archive) |
4.25 | AI-assisted coding: Experiments with GPT-4 (arXiv), (PDF), (arXiv-vanity) |
4.25 | NVIDIA NeMo Guardrails helps enterprises keep applications built on large language models aligned with their safety and security requirements (Blog), (GitHub) |
4.25 | Stable and low-precision training for large-scale vision-language models (arXiv), (PDF), (arXiv-vanity) |
4.25 | AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head (arXiv), (PDF), (arXiv-vanity) |
4.25 | Answering Questions by Meta-Reasoning over Multiple Chains of Thought (arXiv), (PDF), (arXiv-vanity) |
4.25 | Patch-based 3D Natural Scene Generation from a Single Example (arXiv), (PDF), (arXiv-vanity), (Project page) |
4.25 | Generative AI at Work - (NBER), (PDF) |
4.25 | Chatbot Arena |
4.24 | Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model (arXiv), (PDF), (arXiv-vanity),(Project page), (GitHub) |
4.24 | AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays (arXiv), (PDF), (arXiv-vanity) |
4.24 | Pointersect: Neural Rendering with Cloud-Ray Intersection (arXiv), (PDF), (arXiv-vanity), (web) |
4.24 | A Cookbook of Self-Supervised Learning (arXiv), (PDF), (arXiv-vanity) |
4.24 | On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.24 | Towards Realistic Generative 3D Face Models (arXiv), (PDF), (arXiv-vanity) |
4.24 | TextMesh: Generation of Realistic 3D Meshes From Text Prompts (arXiv), (PDF), (arXiv-vanity) |
4.24 | Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training Exam (TXIT): Potentials and Challenges for AI-Assisted Medical Education and Decision Making in Radiation Oncology (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.24 | Social AGI - SAMANTHA (Self-Reflective Artificial Mind Attuned to Naturalistic Thought and Human Adaptability) (GitHub) |
4.24 | Segment Anything in Medical Images (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.24 | Segment Anything in 3D with NeRFs (arXiv), (PDF), (arXiv-vanity), (project page) |
4.24 | WizardLM: Empowering Large Language Models to Follow Complex Instructions (arXiv), (PDF), (arXiv-vanity) |
4.24 | Track Anything: Segment Anything Meets Videos (arXiv), (PDF), (arXiv-vanity) |
4.24 | OpenAI Brand guidelines - (blog) |
4.24 | GPT4Tools: Teaching LLM to Use Tools via Self-instruction - (Project page), (Github), (Video), |
4.24 | RAM: Relate-Anything-Model (GitHub), (Demo) |
4.24 | Chart-GPT 1.0 |
4.23 | Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.23 | Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness (arXiv), (PDF), (arXiv-vanity) |
4.22 | Boosting Theory-of-Mind Performance in Large Language Models via Prompting (arXiv), (PDF), (arXiv-vanity) |
4.22 | LaMP: When Large Language Models Meet Personalization (arXiv), (PDF), (arXiv-vanity), (Project page), (Download), (Leaderboard), (GitHub) |
4.22 | Finetuning Large Language Models (Blog) |
4.21 | Can GPT-4 Perform Neural Architecture Search? (arXiv), (PDF), (arXiv-vanity) |
4.21 | Evaluating Transformer Language Models on Arithmetic Operations Using Number Decomposition (arXiv), (PDF), (arXiv-vanity) |
4.21 | Emergent and Predictable Memorization in Large Language Models (arXiv), (PDF), (arXiv-vanity) |
4.21 | CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval (arXiv), (PDF), (arXiv-vanity) |
4.21 | Bard now helps you code with support for 20+ langs (Python, C++, JS, Go, etc.). (Blog) |
4.21 | Inducing anxiety in large language models increases exploration and bias (arXiv), (PDF), (arXiv-vanity) |
4.20 | Why Does ChatGPT Fall Short in Answering Questions Faithfully? (arXiv), (PDF), (arXiv-vanity) |
4.20 | FinChat.io - The Chat GPT for Finance |
4.20 | LlamaAcademy: Teaching Llamas How to Code (GitHub) |
4.20 | Announcing Google DeepMind: DeepMind + Brain = Google DeepMind (Blog) |
4.20 | "Can ChatGPT Diagnose Me?" How Large Language Models will Transform Clinical Care. Thursday, April 27th, 2023 (RSVP) |
4.20 | StableLM: Stability AI Language Models (GitHub), (Blog) |
4.19 | Fundamental Limitations of Alignment in Large Language Models (arXiv), (PDF), (arXiv-vanity) |
4.19 | Scaling Transformer to 1M tokens and beyond with RMT (arXiv), (PDF), (arXiv-vanity), (Github) |
4.19 | Occupational Heterogeneity in Exposure to Generative AI - (paper), (PDF) |
4.19 | The Unintended Consequences of Censoring Digital Technology -- Evidence from Italy's ChatGPT Ban (arXiv), (PDF), (arXiv-vanity) |
4.19 | CompressGPT: Decrease Token Usage by ~70% (blog) |
4.19 | Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes (arXiv), (PDF), (arXiv-vanity), (Github) |
4.19 | LLM as A Robotic Brain: Unifying Egocentric Memory and Control (arXiv), (PDF), (arXiv-vanity) |
4.19 | Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agent (arXiv), (PDF), (arXiv-vanity) |
4.19 | Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models (arXiv), (PDF), (arXiv-vanity), (project page), (GitHub) |
4.19 | h2oai's LLM repositories - (h2ogpt), (h2o-llmstudio), (Huggingface) |
4.19 | Evaluating Verifiability in Generative Search Engines (arXiv), (PDF), (arXiv-vanity) |
4.19 | How to train your own Large Language Models (Blog) |
4.19 | AI Playground from Vercel Labs (tweet) |
4.19 | StanfordBDHG HealthGPT (tweet), (GitHub) |
4.19 | GPT4All-J : the first Apache-2 Licensed Chatbot that runs locally on your machine (GitHub), (PDF) |
4.19 | PersonalPrivate.AI - system to advise on new patent ideas (tweet) |
4.18 | Economist - The world needs an international agency for artificial intelligence, say two AI experts (Archive) |
4.18 | CancerGPT: Few-shot Drug Pair Synergy Prediction using Large Pre-trained Language Models (arXiv), (PDF), (arXiv-vanity) |
4.18 | Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions (arXiv), (PDF), (arXiv-vanity) |
4.18 | Nature - Why open-source generative AI models are an ethical way forward for science |
4.18 | Autonomous Agents(BabyAGI, AutoGPT) & Agent Simulations(CAMEL, Generative Agents) (Blog) |
4.18 | AutoTaskFormer: Searching Vision Transformers for Multi-task Learning (arXiv), (PDF), (arXiv-vanity) |
4.18 | SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, and More (arXiv), (PDF), (arXiv-vanity), (Project page) |
4.18 | Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models (arXiv), (PDF), (arXiv-vanity), (Project page) |
4.18 | Google - Differentially private heatmaps (Blog) |
4.18 | The Complete Beginners Guide To Autonomous Agents |
4.18 | Llama Lab - A repo dedicated to building cutting-edge AGI projects: llama_agi (inspired by babyagi) and auto_llama (inspired by autogpt) (GitHub), (Llama Hub) |
4.18 | Elon Musk to start ChatGPT rival called “TruthGPT” (tweet) |
4.17 | MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code), (GitHub) |
4.17 | Notice of the Cyberspace Administration of China on Public Comments on the "Administrative Measures for Generative Artificial Intelligence Services (Draft for Comment)" (Announcement) |
4.17 | Pretrained Language Models as Visual Planners for Human Assistance (arXiv), (PDF), (arXiv-vanity) |
4.17 | An Evaluation on Large Language Model Outputs: Discourse and Memorization (arXiv), (PDF), (arXiv-vanity) |
4.17 | Epic, Microsoft bring generative AI to EHRs - ([Microsoft announcement](Microsoft and Epic expand strategic collaboration with integration of Azure OpenAI Service)) |
4.17 | BenchMD: A Benchmark for Modality-Agnostic Learning on Medical Images and Sensors (arXiv), (PDF), (arXiv-vanity) |
4.17 | Towards Robust Prompts on Vision-Language Models (arXiv), (PDF), (arXiv-vanity) |
4.17 | Tool Learning with Foundation Models (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.17 | Low-code LLM: Visual Programming over LLMs (arXiv), (PDF), (arXiv-vanity) |
4.17 | Wired - OpenAI’s CEO Says the Age of Giant AI Models Is Already Over |
4.17 | Synthetic Data from Diffusion Models Improves ImageNet Classification (arXiv), (PDF), (arXiv-vanity) |
4.17 | RedPajama-Data: An Open Source Recipe to Reproduce LLaMA training dataset (GitHib) |
4.17 | Visual Instruction Tuning (arXiv), (PDF), (arXiv-vanity), (GitHub), (Dataset), (Model), (Project page), (Demo) |
4.17 | Learning to Compress Prompts with Gist Tokens (arXiv), (PDF), (arXiv-vanity) |
4.17 | ImpressionGPT: An Iterative Optimizing Framework for Radiology Report Summarization with ChatGPT (arXiv), (PDF), (arXiv-vanity) |
4.17 | Meta - DINOv2: State-of-the-art computer vision models with self-supervised learning (blog), (GitHub), (Demo), (arXiv), (PDF), (arXiv-vanity) |
4.17 | TypingMind - A better UI for ChatGPT (tweet) |
4.16 | Understanding Large Language Models (Blog) |
4.16 | INSIGHT - an autonomous AI that can do medical research (GitHub) |
4.16 | GPT4free - use ChatGPT, for free!! - (GitHub) |
4.16 | Solving Math Word Problems by Combining Language Models With Symbolic Solvers (arXiv), (PDF), (arXiv-vanity) |
4.16 | ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human (arXiv), (PDF), (arXiv-vanity) |
4.16 | Driving and suppressing the human language network using large language models (bioRxiv), (PDF) |
4.16 | MultiGPT (GitHub). (tweet) |
4.16 | OpenAssistant Conversations - Democratizing Large Language Model Alignment (PDF), (YouTube), (hacker news) |
4.16 | Auto-evaluator - lightweight evaluation tool for question-answering using Langchain (GitHub) |
4.16 | NYT - Google Devising Radical Search Changes to Beat Back A.I. Rivals (Archive) |
4.15 | Brex's Prompt Engineering Guide (GitHub) |
4.15 | Graphologue and Sensecape by UCSD Creativity Lab |
4.15 | Tractable Control for Autoregressive Language Generation (arXiv), (PDF), (arXiv-vanity) |
4.15 | Web LLM - language model chats directly onto web browsers (Site), (GitHub) |
4.15 | MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models (Project page). (Paper), (GitHub), (YouTube) |
4.15 | OpenAssistant - The world's largest open-source replication of ChatGPT (site), (GitHub), (Dataset - OASST1), (Paper), (YouTube), (Reddit) |
4.14 | HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge (arXiv), (PDF), (arXiv-vanity), (🏆papers with code) |
4.14 | ChatGPT: Applications, Opportunities, and Threats (arXiv), (PDF), (arXiv-vanity) |
4.14 | Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding (arXiv), (PDF), (arXiv-vanity) |
4.14 | OpenBB Terminal V3.0.0rc2 - (GitHub) |
4.14 | Delta Denoising Score (arXiv), (PDF), (arXiv-vanity), (Project page) |
4.14 | DINOv2: Learning Robust Visual Features without Supervision (arXiv), (PDF), (arXiv-vanity) |
4.14 | Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.14 | WSJ - Elon Musk Creates New Artificial Intelligence Company X.AI (archive), (FT) |
4.14 | Google Med-PaLM 2 - A responsible path to generative AI in healthcare |
4.14 | Meta's open source Animated Drawings - (Blog) |
4.14 | ControlNet v1.1 nightly - (GitHub) |
4.13 | Teenage-AGI (GitHub) |
4.13 | Boosted Prompt Ensembles for Large Language Models (arXiv), (PDF), (arXiv-vanity) |
4.13 | ChatGPT-4 Outperforms Experts and Crowd Workers in Annotating Political Twitter Messages with Zero-Shot Learning (arXiv), (PDF), (arXiv-vanity) |
4.13 | Soundini: Sound-Guided Diffusion for Natural Video Editing (arXiv), (PDF), (arXiv-vanity), (Project page) |
4.13 | Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.13 | Inpaint Anything: Segment Anything Meets Image Inpainting (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.13 | GoalGPT by Nando.ai |
4.13 | Power-seeking can be probable and predictive for trained agents (arXiv), (PDF), (arXiv-vanity) |
4.13 | GoalGPT by Nando.ai |
4.13 | Stable Diffusion XL Beta Available for API Customers and DreamStudio Users |
4.13 | NAB 2023: Introducing Text-Based Editing in Premiere Pro, Properties panel in After Effects, and much more |
4.13 | Announcing New Tools for Building with Generative AI on AWS - Amazon LLM (Titan), AWS fine-tuning model (Bedrock), Amazon copilot competitor (Code whisperer) |
4.13 | FT - We must slow down the race to God-like AI (archive) |
4.13 | Segment Everything Everywhere All at Once (arXiv), (PDF), (arXiv-vanity) |
4.13 | Expressive Text-to-Image Generation with Rich Text (arXiv), (PDF), (arXiv-vanity), (Project page) |
4.13 | AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.12 | Can Large Language Models Transform Computational Social Science? (arXiv), (PDF), (arXiv-vanity) |
4.12 | Galactic ChitChat: Using Large Language Models to Converse with Astronomy Literature (arXiv), (PDF), (arXiv-vanity) |
4.12 | Performance of ChatGPT, GPT-4, and Google Bard on a Neurosurgery Oral Boards Preparation Question Bank (medRxiv), (PDF) |
4.12 | ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning (arXiv), (PDF), (arXiv-vanity) |
4.12 | Nature -Foundation models for generalist medical artificial intelligence (PDF) |
4.12 | Dolly v2 - 12B parameter language model (Model weight), (GitHub), (Blog) |
4.11 | Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond (arXiv), (PDF), (arXiv-vanity), (Project page), (GitHub), (Colab), (Hugging face) |
4.11 | Toxicity in ChatGPT: Analyzing Persona-assigned Language Models (arXiv), (PDF), (arXiv-vanity) |
4.11 | Multi-step Jailbreaking Privacy Attacks on ChatGPT (arXiv), (PDF), (arXiv-vanity) |
4.11 | Building LLM applications for production |
4.11 | Emergent autonomous scientific research capabilities of large language models (arXiv), (PDF), (arXiv-vanity) |
4.11 | OpenAI’s Bug Bounty Program |
4.11 | NTIA’s “AI Accountability Policy Request for Comment” |
4.11 | WSJ - Biden Administration Weighs Possible Rules for AI Tools Like ChatGPT, (archive) |
4.11 | ChemCrow: Augmenting large-language models with chemistry tools (arXiv), (PDF), (arXiv-vanity) |
4.11 | LangChainJS Support for Multiple JS Environments (tweet) |
4.11 | Teaching Large Language Models to Self-Debug (arXiv), (PDF), (arXiv-vanity) |
4.10 | Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models (Paper), (PDF) |
4.10 | On the Possibilities of AI-Generated Text Detection (arXiv), (PDF), (arXiv-vanity) |
4.10 | OpenAGI: When LLM Meets Domain Experts (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.9 | ChatAll - oncurrently chat with ChatGPT, Bing Chat, bard, Alpaca, Vincuna, Claude, ChatGLM, MOSS, iFlytek Spark, ERNIE and more, discover the best answers (GitHub) |
4.9 | BabyAGI JS - (GitHub) |
4.9 | AgentGPT - Auto-GPT directly in the browser (tweet), (GitHub), (demo) |
4.8 | A Recipe for Training Large Models |
4.7 | SuperPrompt Engineer Encourages ChatGPT Hallucinations |
4.7 | Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster (arXiv), (PDF), (arXiv-vanity) |
4.7 | Why think step-by-step? Reasoning emerges from the locality of experience (arXiv), (PDF), (arXiv-vanity) |
4.7 | Generative Agents: Interactive Simulacra of Human Behavior (arXiv), (PDF), (arXiv-vanity), (Project) |
4.7 | Vicuna-7B: small, efficient, yet capable (GitHub), (Weight) |
4.7 | StackLlama (Blog), (Demo), (GitHub) |
4.7 | SegGPT: Segmenting Everything In Context (arXiv), (PDF), (arXiv-vanity), (GitHub), (Demo) |
4.6 | Chrome ships WebGPU (Blog) |
4.6 | GPT detectors are biased against non-native English writers (arXiv), (PDF), (arXiv-vanity) |
4.6 | ChaosGPT: Empowering GPT with Internet and Memory to Destroy Humanity (YouTube) |
4.6 | InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning (arXiv), (PDF), (arXiv-vanity), (Project) |
4.6 | Wired - AI Desperately Needs Global Oversight |
4.6 | Instruction Tuning with GPT-4 (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.6 | GeNVS: Generative Novel View Synthesis with 3D-Aware Diffusion Models (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.6 | Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark (arXiv), (PDF), (arXiv-vanity) |
4.5 | Yoshua Bengio - Slowing down development of AI systems passing the Turing test |
4.5 | Language models are on Replicate - FLAN-T5, GPT-J, and LLaMA (Blog) |
4.5 | Meta's Segment Anything Model (SAM) (Paper), (PDF), (GitHub), (Demo), (arXiv), (PDF), (arXiv-vanity) |
4.4 | Calibrated Chaos: Variance Between Runs of Neural Network Training is Harmless and Inevitable (arXiv), (PDF), (arXiv-vanity) |
4.4 | One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era (arXiv), (PDF), (arXiv-vanity) |
4.4 | LangCahin raised $10 million in seed funding |
4.4 | Kandinsky 2.1 (GitHub), (HuggingFace) |
4.4 | The weights of Vicuna-13B released (WebUI demo) (GitHub) |
4.4 | LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.4 | Summary of ChatGPT/GPT-4 Research and Perspective Towards the Future of Large Language Models (arXiv), (PDF), (arXiv-vanity) |
4.3 | Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling (arXiv), (PDF), (arXiv-vanity) |
4.3 | Vicuna-13B: An Open-Source ChatGPT Alternative That Impresses GPT-4 (Blog), (GitHub) |
4.3 | Baby AGI (GitHub) |
4.3 | Berkley just released Koala-13B! (Demo) |
4.3 | 2023 Artificial Intelligence (AI) Index Report Published by Stanford Institute for Human-Centered Artificial Intelligence (HAI) |
4.3 | The LLM playground - open source (Github) |
4.3 | Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data (arXiv), (PDF), (arXiv-vanity), (GitHub) |
4.2 | GPTCache : A Library for Creating Semantic Cache for LLM Queries - (GitHub) |
4.2 | Better Language Models of Code through Self-Improvement (arXiv), (PDF), (arXiv-vanity) |
4.2 | Eight Things to Know about Large Language Models (arXiv), (PDF), (arXiv-vanity) |
4.2 | LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language Models (arXiv), (PDF), (arXiv-vanity) |
4.1 | Italy curbs ChatGPT, starts probe over privacy concerns |
3.31 | Choose Your Weapon: Survival Strategies for Depressed AI Academics (arXiv), (PDF), (arXiv-vanity) |
3.31 | CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society (arXiv), (PDF), (arXiv-vanity), (GitHub) |
3.31 | A Survey of Large Language Models - Version 1 (arXiv), (PDF), (arXiv-vanity) |
3.31 | (SCIENTIFIC AMERICAN) AI Chatbots Can Diagnose Medical Conditions at Home. How Good Are They? |
3.30 | ChatGPT in Healthcare: A Taxonomy and Systematic Review (medRxiv), (PDF) |
3.30 | Launching the Generative AI Open Source (GenOS) Index - (Index), (Tweet) |
3.30 | Whose Opinions Do Language Models Reflect? (arXiv), (PDF), (arXiv-vanity), (GitHub) |
3.30 | Language Models can Solve Computer Tasks (arXiv), (PDF), (arXiv-vanity) |
3.30 | Self-Refine: Iterative Refinement with Self-Feedback (arXiv), (PDF), (arXiv-vanity) |
3.30 | Humans in Humans Out: On GPT Converging Toward Common Sense in both Success and Failure (arXiv), (PDF), (arXiv-vanity) |
3.30 | List of Open Sourced Fine-Tuned Large Language Models (LLM) |
3.30 | NEJM - Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine |
3.30 | BloombergGPT: A Large Language Model for Finance (arXiv), (PDF), (arXiv-vanity) |
3.30 | Got It AI’s ELMAR challenges GPT-4 and LLaMa, scores well on hallucination benchmarks |
3.30 | HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace (arXiv), (PDF), (arXiv-vanity) |
3.30 | CAIDP claims "The FTC should investigate OpenAI and block GPT over ‘deceptive’ behavior" |
3.30 | Epic to use Microsoft's GPT-4 in EHRs |
3.30 | Auto-GPT: An Autonomous GPT-4 Experiment (GitHub) |
3.29 | AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators (arXiv), (PDF), (arXiv-vanity) |
3.29 | nucleotide transformers - genomics LLM, ranging from 500M to 2.5B parameters - (GitHub) |
3.29 | GeoV-9b - 9 billion parameter causal language model (code, weights, colab) |
3.29 | GPT4All - 7B param language model finetuned from a curated set of 400k GPT-Turbo-3.5 |
3.29 | LLaMA-Adapter!: Efficient Fine-tuning of Language Models with Zero-init Attention |
3.29 | MacGPT 3.2 |
3.29 | GPTEval: NLG Evaluation using GPT-4 with Better Human Alignment (arXiv), (PDF), (arXiv-vanity) |
3.29 | TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs (arXiv), (PDF), (arXiv-vanity) |
3.28 | Natural Selection Favors AIs over Humans arXiv), (PDF), (arXiv-vanity) |
3.28 | ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks (arXiv), (PDF), (arXiv-vanity) |
3.28 | LLaMA voice chat + Siri TTS |
3.28 | Cerebras-GPT - 111M to 13B parameters trained using the Chinchilla formula |
3.28 | Microsoft Security Copilot: Empowering defenders at the speed of AI |
3.28 | Google pix2struct launched today, a multimodal model specializing in screenshot data |
3.28 | OpenFlamingo - a framework that enables training and evaluation of large multimodal models (LMMs) |
3.27 | Microsoft JARVIS (GitHub) |
3.27 | ChatGPT Survey: Performance on NLP datasets |
3.27 | GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models (arXiv), (PDF), (arXiv-vanity) |
3.26 | Nature Language Reasoning, A Survey (arXiv), (PDF), (arXiv-vanity) |
3.26 | Sam Altman: OpenAI CEO on GPT-4, ChatGPT, and the Future of AI - Lex Fridman Podcast #367 |
3.26 | LLaMA voice chat |
3.26 | Japanese Alpaca LoRA |
3.24 | Efficient Methods for Natural Language Processing: A Survey (arXiv), (PDF), (arXiv-vanity) |
3.24 | NYT OPINION - You Can Have the Blue Pill or the Red Pill, and We’re Out of Blue Pills (archive) |
3.24 | Dolly - open source LLM |
3.24 | Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators |
3.24 | ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge (arXiv), (PDF), (arXiv-vanity), (GitHub) |
3.24 | Do large language models need sensory grounding for meaning and understanding? @YannLeCun |
3.23 | OpenAI: ChatGPT Plugins |
3.23 | Opera brings AI ChatGPT bot sidebar to browsers |
3.22 | Artificial muses: Generative Artificial Intelligence Chatbots Have Risen to Human-Level Creativity (arXiv), (PDF), (arXiv-vanity), (paper page), (papers with code) |
3.22 | GitHub: Copilot X |
3.22 | Sparks of Artificial General Intelligence: Early experiments with GPT-4 (arXiv), (PDF), (arXiv-vanity), (YouTube) |
3.22 | Pause Giant AI Experiments: An Open Letter |
3.21 | WSJ - Generative AI Makes Headway in Healthcare |
3.21 | NVIDIA Brings Generative AI to World’s Enterprises |
3.21 | Adobe launches Firefly |
3.21 | Google launches Bard in the US and UK |
3.21 | Microsoft: Bing Image Creator |
3.21 | Stability AI Launches Stable Diffusion Reimagine |
3.20 | Reflexion: an autonomous agent with dynamic memory and self-reflection (arXiv), (PDF), (arXiv-vanity), (GitHub) |
3.20 | March 20 ChatGPT outage: Here’s what happened |
3.20 | Runway Gen-2 |
3.20 | Paper: Capabilities of GPT-4 on Medical Challenge Problems |
3.20 | Making Music with GPT 4 by (Wavtool) |
3.19 | Simple LLM Finetuner (GitHub) |
3.18 | Data-centric Artificial Intelligence: A Survey (arXiv), (PDF), (arXiv-vanity), (GitHub) |
3.17 | Can AI-Generated Text be Reliably Detected? (arXiv), (PDF), (arXiv-vanity) |
3.17 | GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models (arXiv), (PDF), (arXiv-vanity) |
3.16 | WebSHAP: Towards Explaining Any Machine Learning Models Anywhere (arXiv), (PDF), (arXiv-vanity), (GitHub) |
3.16 | LERF: Language Embedded Radiance Fields (arXiv), (PDF), (arXiv-vanity), (GitHub) |
3.16 | Microsoft: Microsoft 365 Copilot |
3.16 | Alpaca LoRA: instruct tune LLAMA on consumer hardware |
3.16 | OpenAI CEO Sam Altman says AI will reshape society, acknowledges risks: 'A little bit scared of this' |
3.15 | A new era for AI and Google Workspace |
3.15 | PyTorch 2.0: Our next generation release |
3.15 | Baidu: ERNIE Bot |
3.15 | Midjourney: Midjourney V5 |
3.15 | arXiv - GPT-4 Technical report |
3.14 | The Lancet - Attention is not all you need: the complicated case of ethically using large language models in healthcare and medicine |
3.14 | THUDM releases ChatGLM-6B |
3.14 | Anthropic: Claude |
3.14 | Google: PaLM API & Workspace |
3.14 | OpenAI: GPT-4 |
3.13 | Stanford Alpaca 7B |
3.13 | Microsoft lays off team that taught employees how to make AI tools responsibly |
3.13 | MiniLLM: Large Language Models on Consumer GPUs |
3.13 | Chatbot UI (Github) |
3.12 | GM explores using ChatGPT in vehicles |
3.10 | Google: PaLM-E |
3.9 | multi-model playground - https://nat.dev |
3.9 | GPT-4 is coming next week |
3.8 | Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models (arXiv), (PDF), (arXiv-vanity) |
3.8 | NYT, Opinion - Noam Chomsky: The False Promise of ChatGPT (archive) |
3.7 | A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT (arXiv), (PDF), (arXiv-vanity) |
3.7 | Radiology - The Role and Limitations of Large Language Models Such as ChatGPT in Clinical Settings and Medical Journalism |
3.7 | Stability AI Acquires Image Editing App Clipdrop |
3.6 | Google: Universal Speech Model |
3.5 | Generative AI: Perspectives from Stanford HAI |
3.5 | UpStage, ChatGPT bot (Askup) on Line |
3.5 | UpStage, ChatGPT bot (Askup) on KakaoTalk |
3.2 | Consistency Models (arXiv), (PDF), (arXiv-vanity), (GitHub) |
3.1 | OpenAI: ChatGPT and Whisper API |
2.28 | Large Language Models Are State-of-the-Art Evaluators of Translation Quality (arXiv), (PDF), (arXiv-vanity) |
2.27 | Best Practices for Using AI When Writing Scientific Manuscripts (ACS Nano 2023, 17, 5, 4091–4093) |
2.27 | Fighting ‘Woke AI,’ Musk Recruits Team to Develop OpenAI Rival |
2.25 | The Lancet - The promise of large language models in health care |
2.25 | AugGPT: Leveraging ChatGPT for Text Data Augmentation (arXiv), (PDF), (arXiv-vanity) |
2.24 | Sam Altman, Planning for AGI and beyond |
2.24 | Meta: LLaMA |
2.23 | Radiology - ChatGPT and the Future of Medical Writing |
2.23 | Instagram co-founders launch AI-powered news app Artifact on Android, iOS |
2.23 | Notion.AI launch |
2.22 | The alignment problem from a deep learning perspective (arXiv), (PDF), (arXiv-vanity) |
2.22 | Microsoft: Bing announcement on mobile and Skype |
2.22 | Science - As scientists explore AI-written text, journals hammer out policies |
2.21 | BadGPT: Exploring Security Vulnerabilities of ChatGPT via Backdoor Attacks to InstructGPT (arXiv), (PDF), (arXiv-vanity) |
2.21 | Hyena Hierarchy: Towards Larger Convolutional Language Models (arXiv), (PDF), (arXiv-vanity) |
2.21 | The PNAS Journals Outline Their Policies for ChatGPT and Generative AI |
2.21 | ChatGPT: Jack of all trades, master of none (arXiv), (PDF), (arXiv-vanity) |
2.17 | Time, ChatGPT cover |
2.17 | OpenAI, Foundry Product Brief |
2.17 | Generative AI on Roblox: Our Vision for the Future of Creation |
2.16 | Do We Still Need Clinical Language Models? (arXiv), (PDF), (arXiv-vanity) |
2.16 | Startup Replit launches a ChatGPT-like bot for coders |
2.15 | A&O announces exclusive launch partnership with Harvey |
2.14 | What Is ChatGPT Doing … and Why Does It Work? (Stephen Wolfram Writings) |
2.14 | 1M ChatGPT plus user |
2.14 | The Gen AI Conference Hosted by Jasper |
2.13 | Google: Vision Transformer 22B |
2.12 | Transformer models: an introduction and catalog (arXiv), (PDF), (arXiv-vanity), (Blog) |
2.10 | arXivGPT launches |
2.10 | OpenAI, ChatGPT plus announce (20$) |
2.9 | Disastrous Chatbot Demo Costs Google $140 Billion |
2.9 | Meta: Toolformer |
2.8 | A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity (arXiv), (PDF), (arXiv-vanity) |
2.8 | Runway launches ground-breaking Gen-1 video generation AI system |
2.7 | Microsoft: Bing ChatGPT |
2.7 | Getty Images sues AI art generator Stable Diffusion in the US for copyright infringement |
2.6 | The Lancet - ChatGPT: friend or foe? |
2.6 | Google: Bard announcement |
2.4 | Theory of Mind May Have Spontaneously Emerged in Large Language Models (arXiv), (PDF), (arXiv-vanity) |
2.4 | POE.com open |
2.3 | Google invests in Anthropic, maker of ChatGPT rival |
2.3 | Naver, SearchGPT announcement |
2.2 | Creating a Large Language Model of a Philosopher (arXiv), (PDF), (arXiv-vanity) |
2.2 | ChatGPT reaches 100 million users two months after launch |
2.1 | The Diagnostic and Triage Accuracy of the GPT-3 Artificial Intelligence Model (medrXiv |
2.1 | OpenAI, released a software tool to help identify text generated by AI |
1.31 | JAMA Network - Nonhuman “Authors” and Implications for the Integrity of Scientific Publication and Medical Knowledge |
1.30 | SingSong: Generating musical accompaniments from singing (arXiv), (PDF), (arXiv-vanity), (GitHub) |
1.30 | China's biggest search engine is to set launch a ChatGPT rival in March |
1.26 | Science Journal - ChatGPT is fun, but not an author |
1.26 | DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature (arXiv), (PDF), (arXiv-vanity) |
1.26 | ChatGPT Is Coming for Classrooms. Don't Panic |
1.26 | ChatGPT passes exams from law and business schools |
1.26 | Google’s new AI turns text into music - MusicLM |
1.24 | Putting ChatGPT's Medical Advice to the (Turing) Test (arXiv), (PDF), (arXiv-vanity) |
1.24 | Nature policy - Tools such as ChatGPT threaten transparent science; here are our ground rules for their use |
1.20 | WAME policy - Chatbots, ChatGPT, and Scholarly Manuscripts |
1.17 | Meet Claude: Anthropic’s Rival to ChatGPT |
1.14 | Microsoft in talks to acquire a 49% stake in ChatGPT owner OpenAI |
1.12 | Multimodal Deep Learning (arXiv), (PDF), (arXiv-vanity) |
1.11 | This Voice Doesn't Exist - Generative Voice AI |
1.9 | Microsoft is looking at OpenAI’s GPT for Word, Outlook, and PowerPoint |
1.5 | Apple launches AI-powered book narrations |
1.5 | Microsoft, VALL-E |
1.4 | ICML conference responds to LLM ethics rule |
1.3 | Enter GPTZeo |
2023.01.01 | Collected by Jonghong Jeon (hollobit@etri.re.kr) |
12.29 | GPT Takes the Bar Exam (arXiv), (PDF), (arXiv-vanity) |
12.27 | bioarXiv - Comparing scientific abstracts generated by ChatGPT to original abstracts using an artificial intelligence output detector, plagiarism detector, and blinded human reviewers |
11.30 | OpenAI, ChatGPT service |
11.28 | NeurIPS 2022 conference |
11.17 | InstructPix2Pix: Learning to Follow Image Editing Instructions |
11.16 | Holistic Evaluation of Language Models (arXiv), (PDF), (arXiv-vanity) |
10.30 | LlamaIndex (GPT Index) GitHub project |
10.23 | LangChain GitHub project |
9.19 | SEQUOIA - Generative AI: A Creative New World |
8.25 | Understanding Diffusion Models: A Unified Perspective (arXiv), (PDF), (arXiv-vanity), (Blog) |
3.29 | Training Compute-Optimal Large Language Models (arXiv), (PDF), (arXiv-vanity), (paper page) |
3.15 | OpenAI, GPT 3.5 announce |
2.11 | Compute Trends Across Three Eras of Machine Learning (arXiv), (PDF), (arXiv-vanity) |
2022.01.01 | |
8.16 | On the Opportunities and Risks of Foundation Models (arXiv), (PDF), (arXiv-vanity) |
4.18 | The Power of Scale for Parameter-Efficient Prompt Tuning (arXiv), (PDF), (arXiv-vanity) |
2021.01.01 | |
Last Modified 2023/04/14 PM19:40 KST |
- Open LLM Leaderboard
- AI Incident Database
- Daily papers by AK
- Awesome-Generative-RecSys - A curated list of Generative Recommender Systems (Paper & Code)
- Prompt Engineering Guide - papers - Github
- awesome-ChatGPT-repositories
- The Rundown
- WEEKLY PAPERS
- Primo.ai LLM wiki
- ML Papers of the Week
- CS 324 - Advances in Foundation Models
- ML timeline
- ChatGPT Timeline
- OpenAI Timeline
- The Rise and Rise of A.I. LLMs