/grok

Grok open release

Primary LanguagePythonApache License 2.0Apache-2.0

Grok-1 Repository

This repository hosts the example code for loading and executing the Grok-1 model, which is available with open weights. The Grok-1 model is a state-of-the-art machine learning model designed for a wide range of applications.

Getting Started

To utilize the Grok-1 model, please follow these instructions:

  1. Download the Checkpoint: First, ensure you have downloaded the checkpoint. Place the ckpt-0 directory within the checkpoint folder.

  2. Install Dependencies: Install the required Python packages by running the following command in your terminal:

    pip install -r requirements.txt
  3. Execute the Model: You can run the model using the provided script. Execute the following command:

    python run.py

This script will load the checkpoint and sample from the model using a test input.

Requirements

Due to the extensive size of the Grok-1 model, which consists of 314 billion parameters, it is imperative to use a machine equipped with sufficient GPU memory to run the example code effectively.

It is important to note that the implementation of the Mixture of Experts (MoE) layer in this repository prioritizes simplicity over efficiency. This decision was made to eliminate the necessity for custom kernels, thereby facilitating the validation of the model's accuracy.

Downloading the Model Weights

The Grok-1 model weights can be acquired using a torrent client with the following magnet link:

magnet:?xt=urn:btih:5f96d43576e3d386c9ba65b883210a393b68210e&tr=https%3A%2F%2Facademictorrents.com%2Fannounce.php&tr=udp%3A%2F%2Ftracker.coppersurfer.tk%3A6969&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce

Contributors

Thanks goes to these wonderful contributors (emoji key following all-contributors specification):

ibab
ibab

💻 📖 🚇 🚧 🤔 👀 🔧 ⚠️ 🔬
TobyPDE
TobyPDE

💻 👀 🐛 🤔 🚧 ⚠️ 📖 🛡️

Contributions from those not on the research team are welcome.

License

The code and associated Grok-1 weights in this release are licensed under the Apache 2.0 license. The license only applies to the source files in this repository and the model weights of Grok-1.