The Solr Operator manages Apache Solr Clouds within Kubernetes. It is built on top of the Kube Builder framework.
The project is currently in beta (v1beta1
), and while we do not anticipate changing the API in backwards-incompatible ways there is no such guarantee yet.
If you run into issues using the Solr Operator, please:
- Reference the version compatibility and upgrade notes provided below
- Create a Github Issue in this repo, describing your problem with as much detail as possible
- Reach out on our Slack channel!
Join us on the #solr-operator channel in the official Kubernetes slack workspace.
- Getting Started
- Contributions
- Version Compatibility and Upgrade Notes
- License
- Code of Conduct
- Security Vulnerability Reporting
Install the Zookeeper & Etcd Operators, which this operator depends on by default. Each is optional, as described in the Zookeeper section.
$ kubectl apply -f example/dependencies
Install the Solr CRDs & Operator
$ make install deploy
Make sure that the solr-operator and a zookeeper-operator are running.
Create an example Solr cloud, with the following configuration.
$ cat example/test_solrcloud.yaml
apiVersion: solr.bloomberg.com/v1beta1
kind: SolrCloud
metadata:
name: example
spec:
replicas: 4
solrImage:
tag: 8.1.1
Apply it to your Kubernetes cluster.
$ kubectl apply -f example/test_solrcloud.yaml
$ kubectl get solrclouds
NAME VERSION DESIREDNODES NODES READYNODES AGE
example 8.1.1 4 2 1 2m
$ kubectl get solrclouds
NAME VERSION DESIREDNODES NODES READYNODES AGE
example 8.1.1 4 4 4 8m
Increase the number of Solr nodes in your cluster.
$ kubectl scale --replicas=5 solrcloud/example
Decrease the number of Solr nodes in your cluster.
$ kubectl delete solrcloud example
What actually gets created when the Solr Cloud is spun up?
$ kubectl get all
NAME READY STATUS RESTARTS AGE
pod/example-solrcloud-0 1/1 Running 7 47h
pod/example-solrcloud-1 1/1 Running 6 47h
pod/example-solrcloud-2 1/1 Running 0 47h
pod/example-solrcloud-3 1/1 Running 6 47h
pod/example-solrcloud-zk-0 1/1 Running 0 49d
pod/example-solrcloud-zk-1 1/1 Running 0 49d
pod/example-solrcloud-zk-2 1/1 Running 0 49d
pod/example-solrcloud-zk-3 1/1 Running 0 49d
pod/example-solrcloud-zk-4 1/1 Running 0 49d
pod/solr-operator-8449d4d96f-cmf8p 1/1 Running 0 47h
pod/zk-operator-674676769c-gd4jr 1/1 Running 0 49d
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
service/example-solrcloud-0 ClusterIP ##.###.###.## <none> 80/TCP 47h
service/example-solrcloud-1 ClusterIP ##.###.##.# <none> 80/TCP 47h
service/example-solrcloud-2 ClusterIP ##.###.###.## <none> 80/TCP 47h
service/example-solrcloud-3 ClusterIP ##.###.##.### <none> 80/TCP 47h
service/example-solrcloud-common ClusterIP ##.###.###.### <none> 80/TCP 47h
service/example-solrcloud-headless ClusterIP None <none> 80/TCP 47h
service/example-solrcloud-zk-client ClusterIP ##.###.###.### <none> 21210/TCP 49d
service/example-solrcloud-zk-headless ClusterIP None <none> 22210/TCP,23210/TCP 49d
NAME READY UP-TO-DATE AVAILABLE AGE
deployment.apps/solr-operator 1/1 1 1 49d
deployment.apps/zk-operator 1/1 1 1 49d
NAME DESIRED CURRENT READY AGE
replicaset.apps/solr-operator-8449d4d96f 1 1 1 2d1h
replicaset.apps/zk-operator-674676769c 1 1 1 49d
NAME READY AGE
statefulset.apps/example-solrcloud 4/4 47h
statefulset.apps/example-solrcloud-zk 5/5 49d
NAME HOSTS PORTS AGE
ingress.extensions/example-solrcloud-common default-example-solrcloud.test.domain,default-example-solrcloud-0.test.domain + 3 more... 80 2d2h
NAME VERSION DESIREDNODES NODES READYNODES AGE
solrcloud.solr.bloomberg.com/example 8.1.1 4 4 4 47h
Solr Clouds require an Apache Zookeeper to connect to.
The Solr operator gives a few options.
Note - Both options below come with options to specify a chroot
, or a ZNode path for solr to use as it's base "directory" in Zookeeper. Before the operator creates or updates a StatefulSet with a given chroot
, it will first ensure that the given ZNode path exists and if it doesn't the operator will create all necessary ZNodes in the path. If no chroot is given, a default of /
will be used, which doesn't require the existence check previously mentioned. If a chroot is provided without a prefix of /
, the operator will add the prefix, as it is required by Zookeeper.
This is an external/internal connection string as well as an optional chRoot to an already running Zookeeeper ensemble. If you provide an external connection string, you do not have to provide an internal one as well.
If you do not require the Solr cloud to run cross-kube cluster, and do not want to manage your own Zookeeper ensemble, the solr-operator can manage Zookeeper ensemble(s) for you.
Using the zookeeper-operator, a new Zookeeper ensemble can be spun up for each solrCloud that has this option specified.
The startup parameter zookeeper-operator
must be provided on startup of the solr-operator for this parameter to be available.
Using etcd-operator, a new Etcd ensemble can be spun up for each solrCloud that has this option specified. A Zetcd deployment is also created so that Solr can interact with Etcd as if it were a Zookeeper ensemble.
The startup parameter etcd-operator
must be provided on startup of the solr-operator for this parameter to be available.
Solr-operator can manage the creation, deletion and modification of Solr collections.
Collection creation requires a Solr Cloud to apply against. Presently, SolrCollection supports both implicit and compositeId router types, with some of the basic configuration options including autoAddReplicas
.
Create an example set of collections against on the "example" solr cloud
$ cat example/test_solrcollection.yaml
apiVersion: solr.bloomberg.com/v1beta1
kind: SolrCollection
metadata:
name: example-collection-1
spec:
solrCloud: example
collection: example-collection
routerName: compositeId
autoAddReplicas: false
numShards: 2
replicationFactor: 1
maxShardsPerNode: 1
collectionConfigName: "_default"
---
apiVersion: solr.bloomberg.com/v1beta1
kind: SolrCollection
metadata:
name: example-collection-2-compositeid-autoadd
spec:
solrCloud: example
collection: example-collection-2
routerName: compositeId
autoAddReplicas: true
numShards: 2
replicationFactor: 1
maxShardsPerNode: 1
collectionConfigName: "_default"
---
apiVersion: solr.bloomberg.com/v1beta1
kind: SolrCollection
metadata:
name: example-collection-3-implicit
spec:
solrCloud: example
collection: example-collection-3-implicit
routerName: implicit
autoAddReplicas: true
numShards: 2
replicationFactor: 1
maxShardsPerNode: 1
shards: "fooshard1,fooshard2"
collectionConfigName: "_default"
$ kubectl apply -f examples/test_solrcollections.yaml
The solr-operator supports the full lifecycle of standard aliases. Here is an example pointing an alias to 2 collections
apiVersion: solr.bloomberg.com/v1beta1
kind: SolrCollectionAlias
metadata:
name: collection-alias-example
spec:
solrCloud: example
aliasType: standard
collections:
- example-collection-1
- example-collection-2
Aliases can be useful when migrating from one collection to another without having to update application endpoint configurations.
Routed aliases are presently not supported
Solr backups require 3 things:
- A solr cloud running in kubernetes to backup
- The list of collections to backup
- A shared volume reference that can be written to from many clouds
- This could be a NFS volume, a persistent volume claim (that has
ReadWriteMany
access), etc. - The same volume can be used for many solr clouds in the same namespace, as the data stored within the volume is namespaced.
- This could be a NFS volume, a persistent volume claim (that has
- A way to persist the data. The currently supported persistence methods are:
- A volume reference (this does not have to be
ReadWriteMany
) - An S3 endpoint.
- A volume reference (this does not have to be
Backups will be tarred before they are persisted.
There is no current way to restore these backups, but that is in the roadmap to implement.
Solr metrics can be collected from solr clouds/standalone solr both residing within the kubernetes cluster and outside. To use the Prometheus exporter, the easiest thing to do is just provide a reference to a Solr instance. That can be any of the following:
- The name and namespace of the Solr Cloud CRD
- The Zookeeper connection information of the Solr Cloud
- The address of the standalone Solr instance
You can also provide a custom Prometheus Exporter config, Solr version, and exporter options as described in the Solr ref-guide.
Note that a few of the official Solr docker images do not enable the Prometheus Exporter.
Versions 6.6
- 7.x
and 8.2
- master
should have the exporter available.
The solr-operator will work with any of the official Solr images currently available.
The solr-operator supports private Docker repo access for Solr images you may want to store in a private Docker repo. It is recommended to source your image from the official Solr images.
Using a private image requires you have a K8s secret preconfigured with appropreiate access to the image. (type: kubernetes.io/dockerconfigjson)
apiVersion: solr.bloomberg.com/v1beta1
kind: SolrCloud
metadata:
name: example-private-repo-solr-image
spec:
replicas: 3
solrImage:
repository: myprivate-repo.jfrog.io/solr
tag: 8.2.0
imagePullSecret: "k8s-docker-registry-secret"
- -zookeeper-operator Whether or not to use the Zookeeper Operator to create dependent Zookeeepers.
Required to use the
ProvidedZookeeper.Zookeeper
option within the Spec. If true, then a Zookeeper Operator must be running for the cluster. ( true | false , defaults to false) - -etcd-operator Whether or not to use the Etcd Operator to create dependent Zetcd clusters.
Required to use the
ProvidedZookeeper.Zetcd
option within the Spec. If true, then an Etcd Operator must be running for the cluster. ( true | false , defaults to false) - -ingress-base-domain If you desire to make solr externally addressable via ingresses, a base ingress domain is required.
Solr Clouds will be created with ingress rules at
*.(ingress-base-domain)
. ( optional , e.g.ing.base.domain
)
The CRD should be updated anytime you update the API.
$ make manifests
Beware that you must be running an updated version of controller-gen
. To update to a compatible version, run:
$ go get sigs.k8s.io/controller-tools/cmd/controller-gen@v0.2.2
Make sure that you have updated the go.mod file:
$ make mod-tidy
Two Docker images are published to DockerHub, both based off of the same base image.
- Builder Image - Downloads gomod dependencies, builds operator executable (This is not published, only used to build the following images)
- Slim Image - Contains only the operator executable
- Vendor Image - Contains the operator executable as well as all dependencies (at
/solr-operator-vendor-sources
)
Building and releasing a test operator image with a custom namespace.
$ NAMESPACE=your-namespace make docker-build docker-push
You can test the vendor docker container by running
$ NAMESPACE=your-namespace make docker-vendor-build docker-vendor-push
- (Download and install latest stable version)[https://docs.docker.com/docker-for-mac/install/]
- Enable K8s under perferences
- Ensure you have kubectl installed on your Mac (if using Brew:
brew install kubernetes-cli
- Run through Getting Started
- Install nginx-ingress configuration
kubectl apply -f https://raw.githubusercontent.com/kubernetes/ingress-nginx/master/deploy/static/mandatory.yaml
kubectl apply -f https://raw.githubusercontent.com/kubernetes/ingress-nginx/master/deploy/static/provider/cloud-generic.yaml
- Ensure your /etc/hosts file is setup to use the ingress for your SolrCloud. Here is what you need if you name your SolrCloud 'example' with 3 replicas
127.0.0.1 localhost default-example-solrcloud.ing.local.domain ing.local.domain default-example-solrcloud-0.ing.local.domain default-example-solrcloud-1.ing.local.domain default-example-solrcloud-2.ing.local.domain dinghy-ping.localhost
- Navigate to your browser: http://default-example-solrcloud.ing.local.domain/solr/#/ to validate everything is working
- The zkConnectionString used for provided zookeepers changed from using the string provided in the
ZkCluster.Status
, which used an IP, to using the service name. This will cause a rolling restart of your solrs using the provided zookeeper option, but there will be no data loss.
- Uses
gomod
instead ofdep
SolrCloud.zookeeperRef.provided.zookeeper.persistentVolumeClaimSpec
has been deprecated in favor of theSolrCloud.zookeeperRef.provided.zookeeper.persistence
option.
This option is backwards compatible, but will be removed in a future version.- An upgrade to the ZKOperator version
0.2.4
is required.
SolrCloud.Spec.persistentVolumeClaim
was renamed toSolrCloud.Spec.dataPvcSpec
- v1.10 - CustomResourceSubresources
We ❤️ contributions.
Have you had a good experience with ? Why not share some love and contribute code, or just let us know about any issues you had with it?
We welcome issue reports here; be sure to choose the proper issue template for your issue, so that we can be sure you're providing the necessary information.
Before sending a Pull Request, please make sure you read our Contribution Guidelines.
Please read the LICENSE file here.
This project has adopted a Code of Conduct. If you have any concerns about the Code, or behavior which you have experienced in the project, please contact us at opensource@bloomberg.net.
If you believe you have identified a security vulnerability in this project, please send email to the project team at opensource@bloomberg.net, detailing the suspected issue and any methods you've found to reproduce it.
Please do NOT open an issue in the GitHub repository, as we'd prefer to keep vulnerability reports private until we've had an opportunity to review and address them.