MoFHeka
Github suspend my old account, so this is my new account. My Gitee url is https://gitee.com/MoFHeka.
Kanzhun.incBeijing
Pinned Repositories
AccurateHeartX
AsterHiredis
A seastar implement for redis client.
execution-ucx
A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.
LLaMA-Alpa
A LLaMa pretrain code by using Alpa(https://github.com/alpa-projects/alpa).
LLaMA-Megatron
A LLaMA1/LLaMA12 Megatron implement.
MeepoEmbedding
A distributed high-performance dynamic lookuptable-style Embedding designed for recommendation, search, CTR and advertising systems. Supports GPU, CPU, remote distributed KV (such as Redis), SSD, and other backends.
Megatron-AutoCkpt
A Megatron checkpoint auto-saving patch at the end of each iteration, inspired by Alibaba PAI EasyCkpt for Megatron.
recommenders-addons
Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.
xla-launcher
XLA Launcher is a high-performance, lightweight C++ library designed to provide a simple interface for loading and executing computation graphs represented in the StableHLO format.
recommenders-addons
Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.
MoFHeka's Repositories
MoFHeka/LLaMA-Megatron
A LLaMA1/LLaMA12 Megatron implement.
MoFHeka/execution-ucx
A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.
MoFHeka/MeepoEmbedding
A distributed high-performance dynamic lookuptable-style Embedding designed for recommendation, search, CTR and advertising systems. Supports GPU, CPU, remote distributed KV (such as Redis), SSD, and other backends.
MoFHeka/xla-launcher
XLA Launcher is a high-performance, lightweight C++ library designed to provide a simple interface for loading and executing computation graphs represented in the StableHLO format.
MoFHeka/AsterHiredis
A seastar implement for redis client.
MoFHeka/LLaMA-Alpa
A LLaMa pretrain code by using Alpa(https://github.com/alpa-projects/alpa).
MoFHeka/Megatron-AutoCkpt
A Megatron checkpoint auto-saving patch at the end of each iteration, inspired by Alibaba PAI EasyCkpt for Megatron.
MoFHeka/recommenders-addons
Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.
MoFHeka/AccurateHeartX
MoFHeka/bazel-central-registry
The central registry of Bazel modules for the Bzlmod external dependency system.
MoFHeka/clash-for-linux-backup
Linux最完整的Clash for Linux的备份仓库,完全可以使用!由Yizuko进行修复及维护
MoFHeka/deepray
Deepray for continuous integration development.
MoFHeka/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
MoFHeka/HierarchicalKV
HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of Merlin-KV is to store key-value feature-embeddings on high-bandwidth memory (HBM) of GPUs and in host memory. It also can be used as a generic key-value storage.
MoFHeka/Megatron-LM
Ongoing research training transformer models at scale
MoFHeka/NeMo
NeMo: a toolkit for conversational AI
MoFHeka/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.
MoFHeka/CL32Q0-CMSIS-DSP
MoFHeka/CL32Q0-debug-bridge
MoFHeka/CL32Q0-demo
MoFHeka/CL32Q0-driver-library
MoFHeka/CL32Q0-thirdparty
MoFHeka/CorelinkIDE
MoFHeka/GeoCompass
MoFHeka/interview-coder
An open-source invisible desktop application to help you pass your technical interviews.
MoFHeka/rules_nccl
MoFHeka/runtime
A performant and modular runtime for TensorFlow
MoFHeka/tensorflow-onnx
Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
MoFHeka/unit-scaling
A library for unit scaling in PyTorch
MoFHeka/verl
verl: Volcano Engine Reinforcement Learning for LLMs