/go-simd

SIMD implementation in Go

Primary LanguageGoMIT LicenseMIT

SIMD Implementation in Golang

This repository contains an implementation of SIMD (Single Instruction, Multiple Data) operations in Go, specifically targeting ARM NEON architecture. The goal is to provide optimized parallel processing capabilities for certain computational tasks.

Future Plans

We are actively working on expanding the SIMD implementation to support x86 architecture as well. The upcoming x86 implementation will provide similar SIMD functionalities for parallel processing on x86-based systems.

Hacks

When we call a C function through CGO, there are some overheads due to Go design. In general, avoiding CGO would be a good idea. But I found a hack, instead of relying on CGO, we can utilize linkname directive to call C code, bypass CGO, and get better performance.

goos: darwin
goarch: arm64
pkg: github.com/alivanz/go-simd/arm/neon
BenchmarkMultRef-8                131395              9168 ns/op
BenchmarkMultSimd-8               598742              1954 ns/op
BenchmarkMultSimdBypass-8         605554              1959 ns/op
BenchmarkMultSimdFull-8          1816879               661.3 ns/op
BenchmarkMultSimdCgo-8             13020             92213 ns/op
PASS

Features

  • SIMD operations for ARM NEON architecture.
  • High-performance parallel processing for specific tasks.
  • Utilizes the power of SIMD instructions to process multiple data elements simultaneously.
  • Supports a range of data types, including integers and floating-point numbers.
  • Modular design for easy integration into existing projects.
  • Well-documented code for understanding and extending the implementation.

Roadmap

  • Implement SIMD operations for ARM NEON architecture.
  • Add support for x86 architecture.
  • Expand SIMD operations for additional data types.
  • Optimize performance for specific use cases.
  • Develop comprehensive test suite for validation.

Usage

To use the SIMD implementations in your project, follow these steps:

  1. Import the required package in your Go code:
import "github.com/alivanz/go-simd"
  1. Use the SIMD functions in your code as needed. Example:
package main

import (
	"log"

	"github.com/alivanz/go-simd/arm"
	"github.com/alivanz/go-simd/arm/neon"
)

func main() {
	var a, b arm.Int8X8
	var add, mul arm.Int16X8
	for i := 0; i < 8; i++ {
		a[i] = arm.Int8(i)
		b[i] = arm.Int8(i * i)
	}
	log.Printf("a = %+v", b)
	log.Printf("b = %+v", a)
	neon.VaddlS8(&add, &a, &b)
	neon.VmullS8(&mul, &a, &b)
	log.Printf("add = %+v", add)
	log.Printf("mul = %+v", mul)
}

Supported Operations

Only ARM Neon supported, for now.

Refer to the documentation in each respective file for more details on how to use each operation.

Contributing

Contributions to this project are welcome. To contribute, please follow these steps:

  1. Fork the repository.
  2. Create a new branch for your feature or bug fix.
  3. Make your changes and commit them with descriptive messages.
  4. Push your changes to your forked repository.
  5. Submit a pull request to the main repository.

Please ensure that your code follows the existing code style and includes appropriate tests.

Acknowledgments

  • The ARM NEON architecture documentation for providing valuable insights into SIMD programming techniques.
  • The open-source community for their contributions and inspiration.

Contact

For any questions or feedback regarding this repository, please feel free to contact me at alivan1627@gmail.com