/Infosys-Transformer-Foundry

A LLM Ops platform that hosts pre curated set of open source models that can be fine tunned and deployed for our customers. Allows users to create their own data pipelines with DAG support and provides effective and optimized way to finetune and deploy models.

Primary LanguagePythonApache License 2.0Apache-2.0

Infosys Transformer Foundry

Overview

Infosys Transformer Foundry solution provides buildings blocks for managing LLM Ops and model life cycle management such as model selection, finetuning, benchmarking, deployment at scale along with data pipelines.

Features offered

Features v1.0

  • Model Zoo: Curated list of open source models along with their metadata, lifecycle status and model tagging.
  • Leaderboard: LLM leaderboard for customer data (text, embedding and code) on public/private models for efficient selection of models.
  • Benchmark tool: Allows benchmarking of fine-tuned or open source models.

Roadmap

  • Data Pipelines: Allow users to create custom data processing workflows for their models.
  • Fine Tuning: User can fine tune a model against custom datasets for tailored results.
  • Model Deployment: Facilitate deployment of curated or fine-tuned models and create access points.

Hardware & Software Requirements

Find the hardware and software requirements here

Installation

Docker

a. Clone the GitHub repository

git clone -b < branch and Repo url>

b. Build docker container for each component

Navigate to each component folder within the repository and build the corresponding Docker image using the following command:

cd <component_folder>
docker build -t ${DOCKER_REPO}/<image_name>

c. Run the Docker Container

docker-compose -f docker-compose.yaml up

Usage and Examples

Benchmark Evaluation

Reporting problems, asking questions

We appreciate your feedbacks, questions or bug reporting regarding this project. When posting issues in GitHub, ensure the posted examples follow the guidelines below:

Minimal: Provide the smallest possible code snippet that still reproduces the problem.

Complete: Include all necessary information (code, configuration, etc.) for someone else to replicate the issue.

Reproducible: Test your provided code to confirm it consistently reproduces the problem.