The sole purpose of this project is to help understand how libuv works.
$ git clone git@github.com:libuv/libuv.git
$ cd libuv
Build with debugging symbols:
$ mkdir build && cd build
$ cmake -DCMAKE_BUILD_TYPE=Debug ..
$ make
The make target names match the source files but with out the suffix.
So to compile check.c
use:
$ make check
And the run the executable:
$ ./check
And you can set through using:
$ lldb ./check
You might want to add libuv/test/.libs/run-tests to your firewall settings.
Libuv provides the following features:
- Event loop
- Timers
- TCP/UDP sockets
- Named pipes
- Filesystem operations
- Signal handling
- Child processes
- TTY
- Threading utilities
Libuv uses the best available options for different operating systems to perform operations related to process, signal, timer and file descriptors. On Linux epoll is used and on MacOSX kqueue, and on Windows GetQueuedCompletionStatusEx.
Libuv uses structs to implement inheritance. Here is an non-libuv example. For example, in libuv we have structs that inherit from uv_stream_t, like uv_tcp_t, uv_pipe_t:
struct uv_stream_s {
UV_HANDLE_FIELDS
UV_STREAM_FIELDS
};
struct uv_tcp_s {
UV_HANDLE_FIELDS
UV_STREAM_FIELDS
UV_TCP_PRIVATE_FIELDS
};
struct uv_pipe_s {
UV_HANDLE_FIELDS
UV_STREAM_FIELDS
int ipc; /* non-zero if this pipe is used for passing handles */
UV_PIPE_PRIVATE_FIELDS
};
This type is the main object where things happen. It runs in a single thread. If you want more threads you would run one instance of this type per thread.
This type represents a resource. For example, a timer handle would be reponsible for calling a callback when the timer expires. Or a tcp handle would be responsible for reading/writing.
This type represents operations. These request can use handles, for example a request to write to a stream would use a stream handle to do so. But there does not have to be a handle.
uv_loop_t* loop = uv_default_loop();
uv_loop_init(loop);
In src/unix/loop.c
we have:
int uv_loop_init(uv_loop_t* loop) {
...
QUEUE_INIT(&loop->wq);
...
}
Notice that the address of loop->wq is being passed into the macro (src/queue.h)
as its sole argument. And wq
is defined as:
void* wq[2];
So wq
is an array with two elements which are of type pointer to void. So,
they can contain any pointer but will need to be casted to the correct type when
used.
#define QUEUE_INIT(q) \
do {
QUEUE_NEXT(q) = (q);
QUEUE_PREV(q) = (q);
}
So QUEUE_NEXT
would be expanded to:
QUEUE_NEXT(&loop->wq) = (&loop->wq);
And QUEUE_NEXT
looks like this:
#define QUEUE_NEXT(q) (*(QUEUE **) &((*(q))[0]))
And if we replace q with the passed in argument &loop->wp
we get:
#define QUEUE_NEXT(&loop->wp) (*(QUEUE **) &((*(&loop->wp))[0]))
So the first line of the QUEUE_INIT
macro will become:
(*(QUEUE **) &((*(&loop->wq))[0])) = &loop->wp;
First, the we dereferece the passed-in &loop->wp
which gives the address of
the array.
(*(QUEUE **) &(loop->wq[0])) = &loop->wp;
Now, &(loop->wq[0]) is the address of first member of the array and this will be of type
void*`, and we are getting the address of that:
auto first = &((*(&s))[0]);
(lldb) expr first
(void ** ) $0 = 0x00007fffffffd1d0
We are then casting that to QUEUE** and then dereferencing.
I'm not understading the reason for this cast. If the first
above is already
of type void** the additional task does nothing. I'm probably missing something
here, but there is an example in queue.c that tries to sort
this out. It also show an example usage of QUEUE which should shed some light
on why these two element arrays are used. The are a member of a struct and the
first entry points to the next element in the queue, and the second the previous.
QUEUE q = {NULL, NULL};
QUEUE_INIT(&q);
(lldb) expr q
(QUEUE) $0 = ([0] = 0x0000000000000000, [1] = 0x0000000000000000)
(lldb) s
(lldb) expr q
(QUEUE) $1 = ([0] = 0x00007fffffffd190, [1] = 0x00007fffffffd190)
(lldb) expr &q
(QUEUE *) $2 = 0x00007fffffffd190
Notice that both entries are set to the address of of q
.
Next, we insert two elements into the queue:
QUEUE_INSERT_TAIL(&q, &loop.wp);
QUEUE_INSERT_TAIL(&q, &loop.wp);
(lldb) expr q
(QUEUE) $4 = ([0] = 0x00007fffffffd180, [1] = 0x00007fffffffd180)
(lldb) s
(lldb) expr q
(QUEUE) $5 = ([0] = 0x00007fffffffd180, [1] = 0x00007fffffffd160)
(lldb) expr &fletch.wp
(void *(*)[2]) $9 = 0x00007fffffffd180
(lldb) expr &rosen.wp
(void *(*)[2]) $10 = 0x00007fffffffd160
Notice that the struct has a queue memmber (or in this case just a member of type void* wt[2].
To initialize a handle there is a common pattern. For example, if we look at the check example we find:
uv_loop_t* loop = uv_default_loop();
uv_loop_init(loop);
uv_check_t check;
uv_check_init(loop, &check);
What does uv_check_init
actually do?
We have to look in src/unix/loop-watcher.c
:
UV_LOOP_WATCHER_DEFINE(check, CHECK)
Now, the UV_LOOP_WATCHER_DEFINE
macro is defined in the same file:
int uv_check_init(uv_loop_t* loop, uv_check_t* handle) {
uv__handle_init(loop, (uv_handle_t*)handle, UV_CHECK);
handle->check_cb = NULL;
return 0;
}
So that looks simple enough, but we need to keep in mind that uv__handle_init
is also a macro which is defined in src/uv-common.h
.
#define uv__handle_init(loop_, h, type_) \
do { \
(h)->loop = (loop_); \
(h)->type = (type_); \
(h)->flags = UV_HANDLE_REF; /* Ref the loop when active. */ \
QUEUE_INSERT_TAIL(&(loop_)->handle_queue, &(h)->handle_queue); \
uv__handle_platform_init(h);
}
while (0)
And QUEUE_INSERT_TAIL
is also a macro (can be found in src/queue.h
):
#define QUEUE_INSERT_TAIL(h, q) \
do { \
QUEUE_NEXT(q) = (h); \
QUEUE_PREV(q) = QUEUE_PREV(h); \
QUEUE_PREV_NEXT(q) = (q); \
QUEUE_PREV(h) = (q); \
} \
while (0)
Notice that h
is the *(loop_)->handle_queue, and that q
is the &(h)->handle_queue.
First, lets just take a look at the loop struct as a reference:
struct uv_loop_s {
/* User data - use this for whatever. */
void* data;
/* Loop reference counting. */
unsigned int active_handles;
void* handle_queue[2];
union {
void* unused[2];
unsigned int count;
} active_reqs;
/* Internal flag to signal loop stop. */
unsigned int stop_flag;
unsigned long flags; \
int backend_fd; \
void* pending_queue[2]; \
void* watcher_queue[2]; \
uv__io_t** watchers; \
unsigned int nwatchers; \
unsigned int nfds; \
void* wq[2]; \
uv_mutex_t wq_mutex; \
uv_async_t wq_async; \
uv_rwlock_t cloexec_lock; \
uv_handle_t* closing_handles; \
void* process_handles[2]; \
void* prepare_handles[2]; \
void* check_handles[2]; \
void* idle_handles[2]; \
void* async_handles[2]; \
void (*async_unused)(void); /* TODO(bnoordhuis) Remove in libuv v2. */ \
uv__io_t async_io_watcher; \
int async_wfd; \
struct { \
void* min; \
unsigned int nelts; \
} timer_heap; \
uint64_t timer_counter; \
uint64_t time; \
int signal_pipefd[2]; \
uv__io_t signal_io_watcher; \
uv_signal_t child_watcher; \
int emfile_fd; \
UV_PLATFORM_LOOP_FIELDS \
}
When using the default loop a reference to a struct of type uv_loop_s is passed
in to uv_loop_init
.
heap_init((struct heap*) &loop->timer_heap);
QUEUE_INIT(&loop->wq);
QUEUE_INIT(&loop->idle_handles);
QUEUE_INIT(&loop->async_handles);
QUEUE_INIT(&loop->check_handles);
QUEUE_INIT(&loop->prepare_handles);
QUEUE_INIT(&loop->handle_queue);
Next,
&((uv_handle_t*)handle)->handle_queue))[0])
&((*(&((uv_handle_t*)handle)->handle_queue))[0])
We can see the complete function expanded by the preprocesor using the following command:
$ gcc -I./include -I./src -E src/unix/loop-watcher.c
int uv_check_init(uv_loop_t* loop, uv_check_t* handle) {
do {
((uv_handle_t*)handle)->loop = (loop);
((uv_handle_t*)handle)->type = (UV_CHECK);
((uv_handle_t*)handle)->flags = UV_HANDLE_REF;
do {
(*(QUEUE **) &((*(&((uv_handle_t*)handle)->handle_queue))[0])) = (&(loop)->handle_queue);
(*(QUEUE **) &((*(&((uv_handle_t*)handle)->handle_queue))[1])) = (*(QUEUE **) &((*(&(loop)->handle_queue))[1]));
((*(QUEUE **) &((*((*(QUEUE **) &((*(&((uv_handle_t*)handle)->handle_queue))[1]))))[0]))) = (&((uv_handle_t*)handle)->handle_queue);
(*(QUEUE **) &((*(&(loop)->handle_queue))[1])) = (&((uv_handle_t*)handle)->handle_queue);
} while (0);
(((uv_handle_t*)handle)->next_closing = ((void *)0);
} while (0);
handle->check_cb = ((void *)0);
return 0;
}
struct uv_loop_s {
/* User data - use this for whatever. */
void* data;
/* Loop reference counting. */
unsigned int active_handles;
void* handle_queue[2];
union {
void* unused[2];
unsigned int count;
} active_reqs;
/* Internal flag to signal loop stop. */
unsigned int stop_flag;
UV_LOOP_PRIVATE_FIELDS
};
#define UV_HANDLE_FIELDS \
/* public */ \
void* data; \
/* read-only */ \
uv_loop_t* loop; \
uv_handle_type type; \
/* private */ \
uv_close_cb close_cb; \
void* handle_queue[2]; \
union { \
int fd; \
void* reserved[4]; \
} u; \
UV_HANDLE_PRIVATE_FIELDS
+------------+ +----------+ +-----------+
| uv_tcp_t | | uv_pipe_t| | uv_tty_t |
+------------+ +----------+ +-----------+
/|\
+---------------------------------------+ +----------+ +-----------+
| uv_stream_t | | uv_udp_t | | uv_poll_t |
+---------------------------------------+ +----------+ +-----------+
+------------------------------------------------------------------+
| uv__io_t |
+------------------------------------------------------------------+
+------------------------------------------------------------------+
| Operating System |
+------------------------------------------------------------------+
uv_tcp_t
, uv_pipe_t
, uv_tty_t
, uv_udp_t
, and uv_poll_t
is what an
application uses.
uv_tcp_t
, uv_pipe_t
, uv_tty_t
are all streams. The common functionality
for these can be found in src/unix/stream.c
.
+------------+ +----------+ +-----------------+ +------------------+
| uv_fs_t | | uv_work_t| | uv_getaddrinfo_t| | uv_getnameinfo_t |
+------------+ +----------+ +-----------------+ +------------------+
+------------------------------------------------------------------+
| uv_work_t |
+------------------------------------------------------------------+
+------------------------------------------------------------------+
| thread pool |
+------------------------------------------------------------------+
uv_fs_t
, uv_work_t
, uv_getaddrsinfo_t
, uv_getnameinfo_t
is what an
application uses. These all run on a thread pool to perform there operations.
The thread pool is only used for file I/0 and for getaddrinfo!
The default size of the thread pool is 4 (uv_threadpool_size).
Just to be clear, the libuv Event Loop is single threaded. The thread pool is
used for file I/O operations.
http://blog.libtorrent.org/2012/10/asynchronous-disk-io/
Kqueue is a mechanism for registering and responding to process, signal, timer, and file descriptor events in the kernel.
To see how libuv utilizes kqueue you can look in src/unix/darwin.c
What uses the thread pool?
- files
- dns.lookup() and getaddrinfo Uses the system resolver library and makes blocking socket calls.
For macosx the event loop source can be found in src/unix/core.c
uv__update_time(loop);
uv__run_timers(loop);
uv__run_pending(loop); //pending handlers are retuned
uv__run_idle(loop); // uv_idle are run at this stage
uv__run_prepare(loop); // uv_prepare are run at this stage
uv__io_poll(loop, timeout);
uv__run_check(loop);
uv__run_closing_handles(loop);
This program is a simple server that will print out what the clients sends it.
$ ./server
And in a different terminal:
$ telnet localhost 7777
uv_checks are performed after polling for I/O.
idle.cc shows and example of using uv_idle.
$ lldb idle
(lldb) br s -f idle.c -l 14
The following are just notes taken while stepping through tick.cc.
struct uv_loop_s {
/* User data - use this for whatever. */
void* data;
/* Loop reference counting. */
unsigned int active_handles;
void* handle_queue[2];
void* active_reqs[2];
/* Internal flag to signal loop stop. */
unsigned int stop_flag;
UV_LOOP_PRIVATE_FIELDS
};
UV_LOOP_PRIVATE_FIELDS can be found in include/uv-unix.h.
27 uv_loop_t* loop = uv_default_loop();
(lldb) bt
* thread #1: tid = 0x429854, 0x0000000100014279 tick`uv__kqueue_init(loop=0x0000000100018800) + 9 at kqueue.c:41, queue = 'com.apple.main-thread', stop reason = step in
* frame #0: 0x0000000100014279 tick`uv__kqueue_init(loop=0x0000000100018800) + 9 at kqueue.c:41 [opt]
frame #1: 0x0000000100012634 tick`uv__platform_loop_init(loop=<unavailable>) + 20 at darwin.c:43 [opt]
frame #2: 0x0000000100009920 tick`uv_loop_init(loop=0x0000000100018800) + 336 at loop.c:62 [opt]
frame #3: 0x0000000100004121 tick`uv_default_loop + 33 at uv-common.c:589 [opt]
frame #4: 0x0000000100001167 tick`main + 23 at tick.cc:27
frame #5: 0x00007fff92f135ad libdyld.dylib`start + 1
Taking a look at kqueue.c:41 we find:
40 int uv__kqueue_init(uv_loop_t* loop) {
-> 41 loop->backend_fd = kqueue();
42 if (loop->backend_fd == -1)
43 return -errno;
44
We can see that libuv is calling kqueue which creates a new queue. For a standalone kqueue example see kqueue-example.
I'm not sure what loop->backend_fd represents yet but this will be set to the file descriptor returned by kqueue. ioctl is later used to close this file descriptor on exec before returning to the caller. More about this below
uv__closeexec:
r = ioctl(fd, set ? FIOCLEX : FIONCLEX);
FIOCLEX = File IOctl CLose On Exec Sets the close-on-exec flag which causes the file descriptor to be closed when the calling process executes a new program. If an exec call is made the current process image will be replaced with a new one but the file descriptors will be left open. The above call will cause the file descriptors to be close in such a case and this is something that is recommended for library implementors.
This will leave us back in loop.c and the line after uv__platform_loop_init(loop)
Next, a call to uv_signal_init_loop->child_watcher is performed.
Libuv uses self-pipes which is the reason for adding this section. The file system is not pollable so instead a thread is used from the thread pool. The blocking call is performed by the thread and when it is completed it will signal the event loop about this using either eventfd (linux) or a self-pipe. There is no way to wait on a thread in a kqueue/epoll loop so when the thread is done, it will write a byte into the pipe (self-pipe) and the kqueue/epoll loop side will wait for this and wakes up when it arrives.
Signals can be sent to any process at any time and each process and have there own signal handlers to handle the various signals. Your application code will get interupted and then resume when the signal handler has run.
Recall that a pipe is one way, one end if for writing and the other for reading. And it handles a stream of bytes. Anything writting to the pipe will be left there until something reads from the pipe (or the pipe will become full and block)
Normally a pipe is shared with subprocesses but a that is not the case with a shared pipe, it is not shared.
The following can be found in uv.h:
#if defined(_WIN32)
# include "uv/win.h"
#else
# include "uv/unix.h"
#endif
And unix.h
can be found in include/uv/unix.h
.