/franz-go

franz-go contains a feature complete, pure Go library for interacting with Kafka from 0.8.0 through 3.0.0+. Producing, consuming, transacting, administrating, etc.

Primary LanguageGoBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause

franz-go - A complete Apache Kafka client written in Go

GoDev GitHub GitHub tag (latest SemVer) Discord Chat

Franz-go is an all-encompassing Apache Kafka client fully written Go. This library aims to provide every Kafka feature from Apache Kafka v0.8.0 onward. It has support for transactions, regex topic consuming, the latest partitioning strategies, data loss detection, closest replica fetching, and more. If a client KIP exists, this library aims to support it.

This library attempts to provide an intuitive API while interacting with Kafka the way Kafka expects (timeouts, etc.).

Features

  • Feature complete client (Kafka >= 0.8.0 through v2.8.0+)
  • Full Exactly-Once-Semantics (EOS)
  • Idempotent & transactional producers
  • Simple (legacy) consumer
  • Group consumers with eager (roundrobin, range, sticky) and cooperative (cooperative-sticky) balancers
  • All compression types supported: gzip, snappy, lz4, zstd
  • SSL/TLS provided through custom dialer options
  • All SASL mechanisms supported (GSSAPI/Kerberos, PLAIN, SCRAM, and OAUTHBEARER)
  • Low-level admin functionality supported through a simple Request function
  • High-level admin package with many helper types to make cluster administration easy.
  • Utilizes modern & idiomatic Go (support for contexts, variadic configuration options, ...)
  • Highly performant by avoiding channels and goroutines where not necessary
  • Written in pure Go (no wrapper lib for a C library or other bindings)
  • Ability to add detailed log messages or metrics using hooks
  • Plug-in metrics support for prometheus, zap, etc.

Works with any Kafka compatible brokers:

  • Redpanda: the fastest and most efficient Kafka compatible event streaming platform
  • Kafka: the original Java project
  • Microsoft Event Hubs
  • Amazon MSK

Getting started

Here's a basic overview of producing and consuming:

seeds := []string{"localhost:9092"}
// One client can both produce and consume!
// Consuming can either be direct (no consumer group), or through a group. Below, we use a group.
cl, err := kgo.NewClient(
	kgo.SeedBrokers(seeds...),
	kgo.ConsumerGroup("my-group-identifier"),
	kgo.ConsumeTopics("foo"),
)
if err != nil {
	panic(err)
}
defer cl.Close()

ctx := context.Background()

// 1.) Producing a message
// All record production goes through Produce, and the callback can be used
// to allow for synchronous or asynchronous production.
var wg sync.WaitGroup
wg.Add(1)
record := &kgo.Record{Topic: "foo", Value: []byte("bar")}
cl.Produce(ctx, record, func(_ *Record, err error) {
	defer wg.Done()
	if err != nil {
		fmt.Printf("record had a produce error: %v\n", err)
	}

})
wg.Wait()

// Alternatively, ProduceSync exists to synchronously produce a batch of records.
if err := cl.ProduceSync(ctx, record).FirstErr(); err != nil {
	fmt.Printf("record had a produce error while synchronously producing: %v\n", err)
}

// 2.) Consuming messages from a topic
for {
	fetches := cl.PollFetches(ctx)
	if errs := fetches.Errors(); len(errs) > 0 {
		// All errors are retried internally when fetching, but non-retriable errors are
		// returned from polls so that users can notice and take action.
		panic(fmt.Sprint(errs))
	}

	// We can iterate through a record iterator...
	iter := fetches.RecordIter()
	for !iter.Done() {
		record := iter.Next()
		fmt.Println(string(record.Value), "from an iterator!")
	}

	// or a callback function.
	fetches.EachPartition(func(p kgo.FetchTopicPartition) {
		for _, record := range p.Records {
			fmt.Println(string(record.Value), "from range inside a callback!")
		}

		// We can even use a second callback!
		p.EachRecord(func(record *Record) {
			fmt.Println(string(record.Value), "from a second callback!")
		})
	})
}

This only shows producing and consuming in the most basic sense, and does not show the full list of options to customize how the client runs, nor does it show transactional producing / consuming. Check out the examples directory for more!

API reference documentation can be found on GoDev. Supplementary information can be found in the docs directory:

docs
├── admin requests — an overview of how to issue admin requests
├── package layout — describes the packages in franz-go
├── producing and consuming — descriptions of producing & consuming & the guarantees
└── transactions — a description of transactions and the safety even in a pre-KIP-447 world

Version Pinning

By default, the client issues an ApiVersions request on connect to brokers and defaults to using the maximum supported version for requests that each broker supports.

Kafka 0.10.0 introduced the ApiVersions request; if you are working with brokers older than that, you must use the kversions package. Use the MaxVersions option for the client if you do so.

As well, it is recommended to set the MaxVersions to the version of your broker cluster. Until KIP-584 is implemented, it is possible that if you do not pin a max version, this client will speak with some features to one broker while not to another when you are in the middle of a broker update roll.

Metrics & logging

Note there exists plug-in packages that allow you to easily add prometheus metrics, go-metrics, zap logging, etc. to your client! See the plugin directory for more information! These plugins are provided under dedicated modules, e.g. github.com/twmb/franz-go/plugin/kprom@v0.1.0.

The franz-go client takes a neutral approach to metrics by providing hooks that you can use to plug in your own metrics.

All connections, disconnections, reads, writes, and throttles can be hooked into, as well as per-batch produce & consume metrics. If there is an aspect of the library that you wish you could have insight into, please open an issue and we can discuss adding another hook.

Hooks allow you to log in the event of specific errors, or to trace latencies, count bytes, etc., all with your favorite monitoring systems.

In addition to hooks, logging can be plugged in with a general Logger interface. A basic logger is provided if you just want to write to a given file in a simple format. All logs have a message and then key/value pairs of supplementary information. It is recommended to always use a logger and to use LogLevelInfo.

See this example for an expansive example of integrating with prometheus! Alternatively, see this example for how to use the plug-in prometheus package!

Benchmarks

This client is quite fast; it is the fastest and most cpu and memory efficient client in Go.

For 100 byte messages,

  • This client is 4x faster at producing than confluent-kafka-go, and up to 10x-20x faster (at the expense of more memory usage) at consuming.

  • This client is 2.5x faster at producing than sarama, and 1.5x faster at consuming.

  • This client is 2.4x faster at producing than segment's kafka-go, and so much faster at consuming that I'm not sure I wrote the consuming comparison correctly here.

To check benchmarks yourself, see the bench example. This example lets you produce or consume to a cluster and see the byte / record rate. The compare subdirectory shows comparison code.

Supported KIPs

Theoretically, this library supports every (non-Java-specific) client facing KIP. Any KIP that simply adds or modifies a protocol is supported by code generation.

KIP Kafka release Status
KIP-1 — Disallow acks > 1 0.8.3 Supported & Enforced
KIP-4 — Request protocol changes 0.9.0 through 0.10.1 Supported
KIP-8 — Flush method on Producer 0.8.3 Supported
KIP-12 — SASL & SSL 0.9.0 Supported
KIP-13 — Throttling (on broker) 0.9.0 Supported
KIP-15 — Close with a timeout 0.9.0 Supported (via context)
KIP-19 — Request timeouts 0.9.0 Supported
KIP-22 — Custom partitioners 0.9.0 Supported
KIP-31 — Relative offsets in message sets 0.10.0 Supported
KIP-32 — Timestamps in message set v1 0.10.0 Supported
KIP-35 — ApiVersion 0.10.0 Supported
KIP-40 — ListGroups and DescribeGroups 0.9.0 Supported
KIP-41 — max.poll.records 0.10.0 Supported (via PollRecords)
KIP-42 — Producer & consumer interceptors 0.10.0 Partial support (hooks)
KIP-43 — SASL PLAIN & handshake 0.10.0 Supported
KIP-48 — Delegation tokens 1.1.0 Supported
KIP-54 — Sticky partitioning 0.11.0 Supported
KIP-57 — Fix lz4 0.10.0 Supported
KIP-62 — background heartbeats & improvements 0.10.1 Supported
KIP-70 — On{Assigned,Revoked} 0.10.1 Supported
KIP-74 — Fetch response size limits 0.10.1 Supported
KIP-78 — ClusterID in Metadata 0.10.1 Supported
KIP-79 — List offsets for times 0.10.1 Supported
KIP-81 — Bound fetch memory usage WIP Supported (through a combo of options)
KIP-82 — Record headers 0.11.0 Supported
KIP-84 — SASL SCRAM 0.10.2 Supported
KIP-86 — SASL Callbacks 0.10.2 Supported (through callback fns)
KIP-88 — OffsetFetch for admins 0.10.2 Supported
KIP-91 — Intuitive producer timeouts 2.1.0 Supported (as a matter of opinion)
KIP-97 — Backwards compat for old brokers 0.10.2 Supported
KIP-98 — EOS 0.11.0 Supported
KIP-101 — OffsetForLeaderEpoch v0 0.11.0 Supported
KIP-102 — Consumer close timeouts 0.10.2 Supported (via context)
KIP-107 — DeleteRecords 0.11.0 Supported
KIP-108 — CreateTopic validate only field 0.10.2 Supported
KIP-110 — zstd 2.1.0 Supported
KIP-112 — Broker request protocol changes 1.0.0 Supported
KIP-113 — LogDir requests 1.0.0 Supported
KIP-117 — Admin client 0.11.0 Supported (via kmsg)
KIP-124 — Request rate quotas 0.11.0 Supported
KIP-126 — Ensure proper batch size after compression 0.11.0 Supported (avoided entirely)
KIP-133 — Describe & Alter configs 0.11.0 Supported
KIP-140 — ACLs 0.11.0 Supported
KIP-144 — Broker reconnect backoff 0.11.0 Supported
KIP-152 — More SASL; SASLAuthenticate 1.0.0 Supported
KIP-183 — Elect preferred leaders 2.2.0 Supported
KIP-185 — Idempotency is default 1.0.0 Supported
KIP-192 — Cleaner idempotence semantics 1.0.0 Supported
KIP-195 — CreatePartitions 1.0.0 Supported
KIP-204 — DeleteRecords via admin API 1.1.0 Supported
KIP-207 — New error in ListOffsets 2.2.0 Supported
KIP-219 — Client-side throttling 2.0.0 Supported
KIP-222 — Group operations via admin API 2.0.0 Supported
KIP-226 — Describe configs v1 1.1.0 Supported
KIP-227 — Incremental fetch 1.1.0 Supported
KIP-229 — DeleteGroups 1.1.0 Supported
KIP-249 — Delegation tokens in admin API 2.0.0 Supported
KIP-255 — SASL OAUTHBEARER 2.0.0 Supported
KIP-266 — Fix indefinite consumer timeouts 2.0.0 Supported (via context)
KIP-279 — OffsetForLeaderEpoch bump 2.0.0 Supported
KIP-289 — Default group.id to null 2.2.0 Supported
KIP-294 — TLS verification 2.0.0 Supported (via dialer)
KIP-302 — Use multiple addrs for resolved hostnames 2.1.0 Supported (via dialer)
KIP-320 — Fetcher: detect log truncation 2.1.0 Supported
KIP-322 — DeleteTopics disabled error code 2.1.0 Supported
KIP-339 — IncrementalAlterConfigs 2.3.0 Supported
KIP-341 — Sticky group bugfix ? Supported
KIP-342 — OAUTHBEARER extensions 2.1.0 Supported
KIP-345 — Static group membership 2.4.0 Supported
KIP-357 — List ACLs per principal via admin API 2.1.0 Supported
KIP-360 — Safe epoch bumping for UNKNOWN_PRODUCER_ID 2.5.0 Supported
KIP-361 — Allow disable auto topic creation 2.3.0 Supported
KIP-368 — Periodically reauthenticate SASL 2.2.0 Supported
KIP-369 — An always round robin produce partitioner 2.4.0 Supported
KIP-380 — Inter-broker protocol changes 2.2.0 Supported
KIP-389 — Group max size error 2.2.0 Supported
KIP-392 — Closest replica fetching w/ rack 2.2.0 Supported
KIP-394 — Require member.id for initial join request 2.2.0 Supported
KIP-396 — Commit offsets manually 2.4.0 Supported
KIP-412 — Dynamic log levels w/ IncrementalAlterConfigs 2.4.0 Supported
KIP-429 — Incremental rebalance (see KAFKA-8179) 2.4.0 Supported
KIP-430 — Authorized ops in DescribeGroups 2.3.0 Supported
KIP-447 — Producer scalability for EOS 2.5.0 Supported
KIP-455 — Replica reassignment API 2.4.0 Supported
KIP-460 — Leader election API 2.4.0 Supported
KIP-464 — CreateTopic defaults 2.4.0 Supported
KIP-467 — Per-record error codes when producing 2.4.0 Supported (and ignored)
KIP-480 — Sticky partition producing 2.4.0 Supported
KIP-482 — Tagged fields (KAFKA-8885) 2.4.0 Supported
KIP-496 — OffsetDelete admin command 2.4.0 Supported
KIP-497 — New AlterISR API 2.7.0 Supported
KIP-498 — Max bound on reads ? Supported
KIP-511 — Client name/version in ApiVersions request 2.4.0 Supported
KIP-514 — Bounded Flush 2.4.0 Supported (via context)
KIP-516 — Topic IDs ??? Supported as it is implemented
KIP-518 — List groups by state 2.6.0 Supported
KIP-519 — Configurable SSL "engine" 2.6.0 Supported (via dialer)
KIP-525 — CreateTopics v5 returns configs 2.4.0 Supported
KIP-526 — Reduce metadata lookups 2.5.0 Supported
KIP-533 — Default API timeout (total time, not per request) 2.5.0 Supported (via RetryTimeout)
KIP-546 — Client Quota APIs 2.5.0 Supported
KIP-554 — Broker side SCRAM APIs 2.7.0 Supported
KIP-559 — Protocol info in sync/join 2.5.0 Supported
KIP-568 — Explicit rebalance triggering on the consumer 2.6.0 Supported
KIP-569 — Docs & type in DescribeConfigs 2.6.0 Supported
KIP-570 — Leader epoch in StopReplica 2.6.0 Supported
KIP-580 — Exponential backoff 2.6.0 Supported
KIP-584 — Versioning scheme for features ? Supported (nothing to do yet)
KIP-588 — Producer recovery from txn timeout 2.7.0 Supported
KIP-590 — Envelope (broker only) 2.7.0 Supported
KIP-595 — New APIs for raft protocol 2.7.0 Supported
KIP-599 — Throttling on create/delete topic/partition 2.7.0 Supported
KIP-602 — Use all resolved addrs by default 2.6.0 Supported (via dialer)
KIP-651 — Support PEM 2.7.0 Supported (via dialer)
KIP-654 — Aborted txns with unflushed data is not fatal 2.7.0 Supported (default behavior)
KIP-664 — Describe producers / etc. 2.8.0 (mostly) Supported
KIP-679 — Strongest producer guarantee by default 3.0.0 Supported (by default always)
KIP-699 — Batch FindCoordinators 3.0.0 Supported
KIP-700 — DescribeCluster 2.8.0 Supported
KIP-709 — Batch OffsetFetch 3.0.0 Supported
KIP-730 - AllocateProducerIDs 3.0.0 Supported
KIP-734 — Support MaxTimestamp in ListOffsets 3.0.0 Supported (simple version bump)
KIP-735 — Bump default session timeout ? Supported

Missing from above but included in librdkafka is:

  • KIP-85, which does not seem relevant for franz-go
  • KIP-92 for consumer lag metrics, which is better suited for an external system via the admin api
  • KIP-223 for more metrics
  • KIP-235, which is confusing but may be implement via a custom dialer and custom kerberos?
  • KIP-359 to verify leader epoch when producing; this is easy to support but actually is not implemented in Kafka yet
  • KIP-421 for dynamic values in configs; librdkafka mentions it does not support it, and neither does franz-go for the same reason (we do not use a config file)
  • KIP-436 is about yet another metric
  • KIP-517, more metrics