lablup/backend.ai
Backend.AI is a streamlined, container-based computing cluster platform that hosts popular computing/ML frameworks and diverse programming languages, with pluggable heterogeneous accelerator support including CUDA GPU, ROCm GPU, TPU, IPU and other NPUs.
PythonLGPL-3.0
Pinned issues
Issues
- 0
Create a manager API to monitor DB connection pools and other relevant database metrics.
#2984 opened by kmkwon94 - 0
- 0
- 0
- 0
Limit the number of sessions (and their kernels) created at a single scheduler tick
#2952 opened by fregataa - 0
- 0
- 1
Add status filter to `endpoint_list`
#2948 opened by agatha197 - 1
Extend accelerator plugin architecture to allow container user to join extra Linux groups
#2850 opened by kyujin-cho - 0
- 0
Cannot update model service's image architecture
#2930 opened by kyujin-cho - 0
- 0
- 0
Support auto scaling on Model Service
#2659 opened by kyujin-cho - 1
Make sure to use node package manager as pnpm
#2435 opened by Yaminyam - 1
- 1
- 0
Introduce memcached and split the check-presets API
#2546 opened by achimnol - 1
- 0
Allow agent to have multiple backend instances
#2562 opened by achimnol - 0
Support hugepages allocation in containers
#2589 opened by achimnol - 0
- 0
Update the krunner-alpine distribution support package to use Alpine 3.17 (musl 1.2.3+)
#2638 opened by achimnol - 0
Support model service controls for superadmin
#2644 opened by agatha197 - 0
Support image's own entrypoint and published ports
#2672 opened by achimnol - 0
order and filter is unabled in endpointlist query
#2722 opened by ggstargame45 - 0
SFTP connection limits
#2747 opened by achimnol - 0
- 0
Storage resource group
#2791 opened by achimnol - 0
- 0
- 0
Silent failure of `DockerAgent.push_image()` due to absense of exception handling
#2569 opened by jopemachine - 0
- 1
- 0
There are two or more matching vfolders error
#2753 opened by gahyuun - 0
Wrong Implicit log level overridding when `--log-level` argument is not provided
#2759 opened by jopemachine - 0
- 0
- 0
- 1
Allow shell evaluation on model definition
#2909 opened by kyujin-cho - 0
- 0
Migrate to pydantic-based local configuration schema
#2764 opened by achimnol - 2
- 0
- 0
- 0
Deprecate all non-paginated list queries in GraphQL
#2560 opened by achimnol - 0
Reduce overall database and query loads
#2559 opened by achimnol - 0
SQLAlchemy query caching
#2547 opened by achimnol - 0
Add a column to the table.
#2513 opened by why-arong - 0
Prettify exceptions in the kernel runner
#2477 opened by achimnol