This is the MongoDB driver for Erlang. MongoDB is a document-oriented database management system. A driver is a client library that provides an API for connecting to MongoDB servers, performing queries and updates on those servers, and performing administrative tasks like creating indexes and viewing statistics. This version of the driver supports connecting to a single server or replica set, and pooling of both types of connections. Both connection types and pools are thread-safe, i.e. multiple processes can use the same connection/pool simultaneously without interfering with each other. This driver is implemented as an Erlang application named mongodb. It depends on another Erlang library application named bson, which defines the document type and its standard binary representation. You need both of these. Below we describe the mongodb application; you should also see the bson application to understand the document type.
Download and compile the application
$ git clone git://github.com/comtihon/mongodb-erlang.git mongodb
$ cd mongodb
$ make
The bson application needs to be started before starting mongodb application
> application:start (bson).
The crypto application needs to be started if you plan to use authorization to mongodb-server 3+.
> application:start (crypto).
The mongodb application needs be started before using (to initialize an internal ets table of counters)
> application:start (mongodb).
Although the mongodb application includes several modules, you should only need to use top-level driver interfaces mongo or mongoc. mongo can be used for direct connections to a single mongod or mongos server. Use mongoc for connection to sharded or replica set Mongo deployments (As a matter of fact mongoc is built upon mongo module and also can be used for connections to a single server. Also it has built-in support of pooling connections so you won't be in any need of using extra pooling libraries like a poolboy). Likewise, you should only need to use the bson module in the bson application.
To connect to a database test
on mongodb server listening on localhost:27017
(or any address & port of your choosing)
use mc_worker_api:connect/1
.
> Database = <<"test">>.
> {ok, Connection} = mc_worker_api:connect ([{database, Database}]).
mc_worker_api:connect
returns {error, Reason}
if it failed to connect. See arguments you can pass in mc_worker_api.erl
type spec:
-type arg() :: {database, database()}
| {login, binary()}
| {password, binary()}
| {w_mode, write_mode()}
| {r_mode, read_mode()}
| {host, list()}
| {port, integer()}
| {register, atom() | fun()}
| {next_req_fun, fun()}.
To connect mc_worker in your supervised pool, use mc_worker:start_link/1
instead and pass all args to it.
safe
, along with {safe, GetLastErrorParams}
and unsafe
, are write-modes. Safe mode makes a getLastError request
after every write in the sequence. If the reply says it failed then the rest of the sequence is aborted and returns
{failure, {write_failure, Reason}}
, or {failure, not_master}
when connected to a slave. An example write
failure is attempting to insert a duplicate key that is indexed to be unique. Alternatively, unsafe mode issues every
write without a confirmation, so if a write fails you won't know about it and remaining operations will be executed.
This is unsafe but faster because you there is no round-trip delay.
master
, along with slave_ok
, are read-modes. master
means every query in the sequence must read fresh data (from
a master/primary server). If the connected server is not a master then the first read will fail, the remaining operations
will be aborted, and mongo:do
will return {failure, not_master}
. slave_ok
means every query is allowed to read
stale data from a slave/secondary (fresh data from a master is fine too).
If you set {register, Name}
option - mc_worker process will be registered on this Name, or you can pass function
fun(pid())
, which it runs with self pid.
If you set {login, Login}
and {password, Password}
options - mc_worker will try to authenticate to the database.
next_req_fun
is a function caller every time, when worker sends request to database. It can be use to optimise pool
usage. When you use poolboy transaction (or mongoc transaction, which use poolboy transaction) - mc_worker
sends request
to database and do nothing, waiting for reply. You can use {next_req_fun, fun() -> poolboy:checkin(?DBPOOL, self()) end}
to make workers return to pool as soon as they finish request. When responce from database comes back - it will be saved in
mc_worker msgbox. Msgbox will be processed just before the next call to mc_worker.
Notice, that poolboy's pool should be created with {strategy, fifo}
to make uniform usage of pool workers.
After you connected to your database - you can carry out write operations, such as insert
, update
and delete
:
> Collection = <<"test">>.
> mc_worker_api:insert(Connection, Collection, [
{<<"name">>, <<"Yankees">>, <<"home">>, {<<"city">>, <<"New York">>, <<"state">>, <<"NY">>}, <<"league">>, <<"American">>},
{<<"name">>, <<"Mets">>, <<"home">>, {<<"city">>, <<"New York">>, <<"state">>, <<"NY">>}, <<"league">>, <<"National">>},
{<<"name">>, <<"Phillies">>, <<"home">>, {<<"city">>, <<"Philadelphia">>, <<"state">>, <<"PA">>}, <<"league">>, <<"National">>},
{<<"name">>, <<"Red Sox">>, <<"home">>, {<<"city">>, <<"Boston">>, <<"state">>, <<"MA">>}, <<"league">>, <<"American">>}
]),
An insert example (from mongo_SUITE
test module). Connection
is your Connection, got from mc_worker_api:connect
, Collection
is your collection name, Doc
is something, you want to save.
Doc will be returned, if insert succeeded. If Doc doesn't contains _id
field - an updated Doc will be returned - with
automatically generated '_id' fields. If error occurred - Connection will fall.
> mc_worker_api:delete(Connection, Collection, Selector).
Delete example. Connection
is your Connection, Collection
- is a collection you want to clean. Selector
is the
rules for cleaning. If you want to clean everything - pass empty {}
.
You can also use maps instead bson documents:
> Collection = <<"test">>.
> mc_worker_api:insert(Connection, Collection, #{<<"name">> => <<"Yankees">>, <<"home">> =>
#{<<"city">> => <<"New York">>, <<"state">> => <<"NY">>}, <<"league">> => <<"American">>}),
To call read operations use find
, find_one
:
> Cursor = mc_worker_api:find(Connection, Collection, Selector)
All params similar to delete
.
The difference between find
and find_one
is in return. Find_one just returns your result, while find returns you a
Cursor
- special process' pid. You can query data through the process with the help of `mc_cursor' module.
> Result = mc_cursor:rest(Cursor),
> mc_cursor:close(Cursor),
Important! Do not forget to close cursors after using them!
To search for params - specify Selector
:
mc_worker_api:find_one(Connection, Collection, {<<"key">>, <<"123">>}).
will return one document from collection Collection with key == <<"123">>.
mc_worker_api:find_one(Connection, Collection, {<<"key">>, <<"123">>, <<"value">>, <<"built_in">>}).
will return one document from collection Collection with key == <<"123">> and
value == <<"built_in">>.
Tuples {<<"key">>, <<"123">>}
in first example and {<<"key">>, <<"123">>, <<"value">>, <<"built_in">>}
are selectors.
For filtering result - use Projector
:
mc_worker_api:find_one(Connection, Collection, {}, [{projector, {<<"value">>, true}]).
will return one document from collection Collection with fetching only
_id and value.
mc_worker_api:find_one(Connection, Collection, {}, [{projector, {<<"key">>, false, <<"value">>, false}}]).
will return your data without key and value params. If there is no other data - only _id will be returned.
Important! For empty projector use []
instead {}
. For empty selector use {}
.
To add or update field in document - use mc_worker_api:update
function with $set
param.
This updates selected fields:
Command = {<<"$set">>, {
<<"quantity">>, 500,
<<"details">>, {<<"model">>, "14Q3", <<"make">>, "xyz"},
<<"tags">>, ["coats", "outerwear", "clothing"]
}},
mc_worker_api:update(Connection, Collection, {<<"_id">>, 100}, Command),
This will add new field expired
, if there is no such field, and set it to true.
Command = {<<"$set">>, {<<"expired">>, true}},
mc_worker_api:update(Connection, Collection, {<<"_id">>, 100}, Command),
This will update fields in nested documents.
Command = {<<"$set">>, {<<"details.make">>, "zzz"}},
mc_worker_api:update(Connection, Collection, {<<"_id">>, 100}, Command),
This will update elements in array.
Command = {<<"$set">>, {
<<"tags.1">>, "rain gear",
<<"ratings.0.rating">>, 2
}},
mc_worker_api:update(Connection, Collection, {'_id', 100}, Command),
For result of executing this functions - see mongo_SUITE update test.
To create indexes - use mc_worker_api:ensure_index/3
command:
mc_worker_api:ensure_index(Connection, Collection, {<<"key">>, {<<"index">>, 1}}). %simple
mc_worker_api:ensure_index(Connection, Collection, {<<"key">>, {<<"index">>, 1}, <<"name">>, <<"MyI">>}). %advanced
mc_worker_api:ensure_index(Connection, Collection, {<<"key">>, {<<"index">>, 1}, <<"name">>, <<"MyI">>, <<"unique">>, true, <<"dropDups">>, true}). %full
ensure_index takes mc_worker
' pid or atom name as first parameter, collection, where to create index, as second
parameter and bson document with index
specification - as third parameter. In index specification one can set all or only some parameters.
If index specification is not full - it is automatically filled with values: name, Name, unique, false, dropDups, false
, where Name
is index's key.
This driver does not provide helper functions for commands. Use mc_worker_api:command
directly and refer to the
MongoDB documentation for how to issue raw commands.
To authenticate use function mc_worker_api:connect
, or mc_worker:start_link([...{login, <<"login">>}, {password, <<"password">>}...]
Login and password should be binaries!
By default timeout for all connections to connection gen_server is infinity
. If you found problems with it - you can
modify timeout.
To modify it just add mc_worker_call_timeout
with new value to your applications's env config.
Timeout for operations with cursors may be explicity passed to mc_cursor:next/2
, mc_cursor:take/3
, mc_cursor:rest/2
, and mc_cursor:foldl/5
functions, by default used value of cursor_timeout
from application config, or infinity
if cursor_timeout
not specified.
For pooling use Poolboy with mc_worker as pool workers.
You can use mongo_api.erl
for easy working with mongoc, or as an example of building your own api.
For opening connection to a MongoDB you can use this call of mongoc:connect method:
{ok, Topology} = mongoc:connect( Seed, Options, WorkerOptions )
% or
{ok, Topology} = mongo_api:connect(Type, Hosts, Options, WorkerOptions)
Where Seed contains information about host names and ports to connect and info about topology of MongoDB deployment.
So you can pass just a hostname with port (or tuple with single key) for connection to a single server deployment:
"hostname:27017"
{ single, "hostname:27017" }
If you want to connect to a replica set ReplicaSetName use this format of Seeds value:
{ rs, <<"ReplicaSetName">>, [ "hostname1:port1", "hostname2:port2"] }
To connect to a sharded cluster of mongos:
{ sharded, ["hostname1:port1", "hostname2:port2"] }
And if you want your MongoDB deployment metadata to be auto revered use unknow id in Seed tuple:
{ unknown, "hostname1:port1", "hostname2:port2"] }
Type in mongo_api:connect
is topology type (unknown
| sharded
).
mongoc topology Options
[
{ name, Name }, % Name should be used for mongoc pool to be registered with
{ register, Name }, % Name should be used for mongoc topology process to be registered with
{ pool_size, 5 }, % pool size on start
{ max_overflow, 10 }, % number of overflow workers be created, when all workers from pool are busy
{ overflow_ttl, 1000 }, % number of milliseconds for overflow workers to stay in pool before terminating
{ overflow_check_period, 1000 }, % overflow_ttl check period for workers (in milliseconds)
{ localThresholdMS, 1000 }, % secondaries only which RTTs fit in window from lower RTT to lower RTT + localThresholdMS could be selected for handling user's requests
{ connectTimeoutMS, 20000 },
{ socketTimeoutMS, 100 },
{ serverSelectionTimeoutMS, 30000 }, % max time appropriate server should be select by
{ waitQueueTimeoutMS, 1000 }, % max time for waiting worker to be available in the pool
{ heartbeatFrequencyMS, 10000 }, % delay between Topology rescans
{ minHeartbeatFrequencyMS, 1000 },
{ rp_mode, primary }, % default ReadPreference mode - primary, secondary, primaryPreferred, secondaryPreferred, nearest
{ rp_tags, [{tag,1}] }, % tags that servers shoul be tagged by for becoming candidates for server selection (may be an empty list)
]
mongoc WorkerOptions (as described in mongo Connecting chapter)
-type arg() :: {database, database()} % default database and it is also the initial db for auth purposes, later you can give other db name in requests
| {login, binary()}
| {password, binary()}
| {w_mode, write_mode()}.
Use transaction poolboy-like interface for mongoc:
mongoc:transaction_query(?DBPOOL,
fun(Conf) ->
mongoc:find_one(Conf, Collection, Key, Projector, 0)
end)
mongoc:transaction_query(?DBPOOL,
fun(Conf) ->
Res = mongoc:find_one(Conf, Collection, Key, Projector, 0),
mc_worker:hibernate(Conf),
Res
end)
mongoc:transaction(?DBPOOL, fun(Conf) -> mongoc:count(Conf, Collection, Value, [], 1) end, [])
mongoc:transaction(?DBPOOL,
fun(Worker) -> mc_worker_api:update(Worker, Collection, Key, Command, Upsert, Multi) end)
Notice, that all write operations like update
, insert
, delete
do with mongo, but all read operations
do with mongoc.
You can set up your read preferences when reading:
mongoc:transaction_query(?DBPOOL,
fun(Conf) ->
mongoc:find_one(Conf#{read_preference => secondaryPreferred}, Collection, Key, Projector, 0)
end)
API Docs - Documentation generated from source code comments