A comprehensive platform for tracking and comparing AI model performance across various benchmarks.
- Performance Matrix showing model scores across different benchmarks
- Detailed benchmark information pages
- Filtering by categories and model types
- Search functionality
- RESTful API for data access
- Responsive design for all devices
- Next.js 14
- TypeScript
- Tailwind CSS
- React Hooks for state management
- Express.js
- TypeScript
- Prisma ORM
- SQLite (development) / PostgreSQL (production)
ai-benchmark-platform/
├── frontend/ # Next.js frontend application
│ ├── src/
│ │ ├── app/ # Next.js app directory
│ │ ├── components/ # React components
│ │ └── styles/ # Global styles
│ ├── public/ # Static assets
│ └── package.json
│
├── backend/ # Express.js backend application
│ ├── src/
│ │ ├── routes/ # API routes
│ │ └── server.ts # Server entry point
│ ├── prisma/ # Database schema and migrations
│ └── package.json
- Node.js 18+
- npm or yarn
- Git
-
Clone the repository: ```bash git clone https://github.com/Belzedar94/ai-benchmark-platform.git cd ai-benchmark-platform ```
-
Install backend dependencies: ```bash cd backend npm install ```
-
Set up the database: ```bash npx prisma migrate dev npx prisma db seed ```
-
Install frontend dependencies: ```bash cd ../frontend npm install ```
-
Start the backend server: ```bash cd backend npm run dev ```
-
In a new terminal, start the frontend: ```bash cd frontend npm run dev ```
The application will be available at:
- Frontend: http://localhost:3000
- Backend API: http://localhost:3001
- `GET /api/benchmarks`: List all benchmarks
- `GET /api/benchmarks/:id`: Get benchmark details
- `GET /api/models`: List all models
- `GET /api/categories`: List all categories
- `GET /api/search`: Search benchmarks and models
```json { "benchmarks": [ { "id": 1, "name": "GLUE", "description": "General Language Understanding Evaluation benchmark", "category": { "id": 1, "name": "Natural Language Processing" }, "scores": [ { "score": 89.3, "model": { "name": "GPT-3" } } ] } ] } ```
- Use TypeScript for type safety
- Follow ESLint configuration
- Use Prettier for code formatting
- `master`: Production-ready code
- `develop`: Development branch
- Feature branches: `feature/feature-name`
Run tests: ```bash npm test ```
- Connect your Vercel account
- Configure environment variables
- Deploy using Vercel CLI or GitHub integration
- Set up PostgreSQL database
- Configure environment variables
- Deploy to your preferred hosting service
- Fork the repository
- Create a feature branch
- Commit your changes
- Push to the branch
- Create a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
- ✅ Core platform functionality
- ✅ Performance matrix
- ✅ Basic filtering and search
- 🔄 Testing implementation
- 🔄 Deployment setup
- User authentication
- Enhanced visualizations
- Community features
- Performance optimizations
- Advanced analytics
For questions or feedback, please open an issue in the GitHub repository.