- TCP is a widely used protocol for communication between applications. It is a reliable protocol, built on top of IP.
- IP(Internet Protocol) is a connectionless protocol, which means that there is no established connection between the sender and the receiver.
- When a program sends data over IP, the data is broken into packets, each of which is sent individually to the receiver. The receiver then reassembles the packets into the original data.
- Each packet contains : * a header section * a data section
- The header contains information about the packet, such as the source and destination IP addresses, the length of the packet, and a checksum to ensure that the packet is not corrupted.
- The data section contains the actual data being sent.
- TCP was created to address the limitations of IP.
- Primarly TCP offers two guarantees: * The data will be delivered . It does this by asking the receiver to acknowledge all sent packets, and re-transmitting any packets if an acknowledgement isn't received. * The data will be delivered in the order in which it was sent. This is done by numbering each packet, and asking the receiver to acknowledge the packets in order.
- TCP is a connection-oriented protocol, which means that a connection must be established between the sender and the receiver before data can be sent.To do this, one program must act as a server, and the other as a client.
- The server listens for incoming connections, and the client initiates a connection to the server.Both the server and the client can send and receive data(It's a two-way channel).
- A TCP connection is identified using a unique combination of four values: * The source IP address * The destination IP address * The source port * The destination port
The TCP handshake is how a connection is established between a client and a server. It is a three-step process: -Step 1 : SYN : The client sends a packet with the SYN flag set to the server. This tells the server that the client wants to establish a connection. -Step 2 : SYN-ACK : The server responds with a packet that has the SYN and ACK flags set. This tells the client that the server has received the request, and is willing to establish a connection. -Step 3 : ACK : The client sends a packet with the ACK flag set. This tells the server that the client has received the response, and is ready to start sending data.
TCP is the underlying protocol for many networked applications, including web servers, databases, and messaging systems.In this part, we will build a simple TCP server in Rust using std::net
module.
To write a TCP Server, we'll need to be familiar with the following methods:
- TcpListener::bind : This method creates a new TcpListener which will listen for incoming connections on the specified address.
- TcpListener::incoming : This method returns an iterator over the connections received on this listener.
- TcpListener::connect : This method creates a new TcpStream and connects to the specified address.
- TcpStream::read : This method reads data from the stream.
- TcpStream::write_all : This method writes a buffer into the stream.
The TcpListener
struct is used to listen for incoming TCP connections. It is created by calling the bind
method on the TcpListener
type, and it listens for incoming connections on the specified address.
Here are some methods that are available on the TcpListener
struct:
impl TcpListener {
// accept waits for and returns the next connection to the listener
pub fn accept(&self) -> Result<(TcpStream, SocketAddr)>
// incoming returns an iterator over the connections being received on this listener
pub fn incoming(&self) -> Incoming<TcpStream>
// local_addr returns the local socket address of the listener
pub fn local_addr(&self) -> Result<SocketAddr>
}
Once you've created a listener, you can use TcpListener::incoming() to get an iterator over the incoming connections.
This method returns an iterator that yields connections as they are accepted, allowing you to handle each new connection in a loop.
use std::net::{TcpListener, TcpStream};
use std::io::{Read, Write};
fn main() {
let listener = TcpListener::bind("localhost:7878").unwrap();
for stream in listener.incoming() {
match stream {
Ok(stream) => {
println!("New connection: {}", stream.peer_addr().unwrap());
}
Err(e) => {
println!("Error: {}", e);
}
}
}
}
The iterator returned by the incoming
method of the TcpListener
struct yields a new TcpStream
for each incoming connection. The TcpStream
struct is used to read and write data to and from the connection.
Here are some methods that are available on the TcpStream
struct:
impl TcpStream {
// read reads bytes from the stream
pub fn read(&mut self, buf: &mut [u8]) -> Result<usize>
// write writes bytes to the stream and returns the number of bytes written.
// It's often easier to use write_all instead of this method.
pub fn write(&mut self, buf: &[u8]) -> Result<usize>
// write_all writes all the bytes in buf to the stream
pub fn write_all(&mut self, buf: &[u8]) -> Result<()>
}
To read data from a connection, you'll need to pass in a mutable byte slice to TcpStream::read. The data received will be stored in this byte slice. TcpStream::read returns a Result indicating the number of bytes read:
let mut buf = [0; 1024];
let n = stream.read(&mut buf)?;
println!("received {} bytes", n);
println!("data: {:?}", &buf[..n]);
To write data to a connection, you'll need to pass in a byte slice to TcpStream::write_all. It returns a Result<()> indicating whether the write was successful:
let buf = b"hello world";
stream.write_all(buf)?;
println!("wrote to stream");
Let's put it all together and build a simple TCP server in Rust.
use std::io::{Read, Write};
use std::net::{TcpListener, TcpStream};
fn main() {
// Creates a TCP server listening on localhost:8080
let listener = TcpListener::bind("localhost:8080").expect("Could not bind");
for stream in listener.incoming() {
match stream {
Ok(stream) => {
handle_client(stream);
}
Err(e) => {
eprintln!("Failed: {}", e);
}
}
}
}
fn handle_client(mut stream: TcpStream) {
let mut buf = [0; 512]; // 512 byte buffer
loop {
let bytes_read = stream.read(&mut buf).expect("Failed to read from client");
if bytes_read == 0 {
return;
}
stream.write_all(&buf[0..bytes_read]).expect("Failed to write to client");
}
}
There are some limitations to this server:
- It can only handle one client at a time.
- It reads a fixed-size buffer from the client, and writes the same buffer back to the client.
- It doesn't handle errors gracefully.For example, if a client disconnects abruptly, the server will panic.
RESP is the protocol used by Redis. It is a simple protocol that is easy to implement and parse. It is also human-readable, which makes it easy to debug.it supports several data types, including strings, integers, arrays, and errors.
We can categorize RESP data types as either simple, bulk, or aggregate.
Simple strings are used to represent text data. They are prefixed with a '+' character, and are terminated with a CRLF (Carriage Return Line Feed) sequence.
For example, the string "OK" is represented as:
+OK\r\n
Simple errors are used to represent error messages. They are prefixed with a '-' character, and are terminated with a CRLF sequence.
For example, the error message "ERR operation not permitted" is represented as:
-ERR operation not permitted\r\n
Integers are used to represent whole numbers. They are prefixed with a ':' character, and are terminated with a CRLF sequence.
For example, the number 1000 is represented as:
:1000\r\n
Bulk strings are used to represent binary data. They are prefixed with a '$' character, followed by the length of the string in bytes, and are terminated with a CRLF sequence.
For example, the string "foobar" is represented as:
$6\r\nfoobar\r\n
Arrays are used to represent a collection of RESP data types. They are prefixed with a '*' character, followed by the number of elements in the array, and are terminated with a CRLF sequence.
For example, the array ["foo", "bar", "baz"] is represented as:
*3\r\n$3\r\nfoo\r\n$3\r\nbar\r\n$3\r\nbaz\r\n
Null bulk strings are used to represent a null value. They are represented as:
$-1\r\n
Null arrays are used to represent a null array. They are represented as:
*-1\r\n
The difference between Null Bulk Strings
and Null Arrays
is that Null Bulk Strings
are used to represent a null value, while Null Arrays
are used to represent a null array. For example, if a command returns a null value, it will be represented as a Null Bulk String
. If a command returns a null array, it will be represented as a Null Array
.
Arrays can contain null elements. For example, the array ["foo", null, "baz"] is represented as:
*3\r\n$3\r\nfoo\r\n$-1\r\n$3\r\nbaz\r\n
Nulls' encoding is the underscore (_) character, followed by the CRLF terminator (\r\n). Here's Null's raw RESP encoding:
$_\r\n
Boolean's encoding is the #
character, followed by the value t
for true or f
for false, and the CRLF terminator (\r\n). Here's Boolean's raw RESP encoding:
#t\r\n
#f\r\n
The Double RESP type encodes a double-precision floating point value. Doubles are encoded as follows:
,3.14\r\n
This type can encode integer values outside the range of signed 64-bit integers.
Big numbers use the following encoding:
([+|-]<number>\r\n
For example, the number 3492890328409238509324850943850943825024385 is encoded as:
(3492890328409238509324850943850943825024385\r\n
There is many other types of RESP data types, but I think these are enough as a start. You can find the full list of RESP data types in the official documentation.
New RESP connections should begin the session by calling HELLO
command. The client sends the HELLO
command to the server, and the server responds with the supported version of the RESP protocol.
We can use the RESP serialization format to write redis client library. We can further specify how the interaction between the client and the server works:
- A client sends the Redis server an array consisting of only bulk strings.
- A Redis server replies to clients, sending any valid RESP data type as a reply.
Here's an example of how a client sends a command to the server:
*3\r\n$3\r\nSET\r\n$5\r\nmykey\r\n$7\r\nmyvalue\r\n
Redis follows the leader-follower approach(master-replica). It allows replica Redis to be exact copies of master instances.The replica will automatically reconnect to the master every time the link breaks, and will attempt to be an exact copy of it regardless of what happens to the master.
This system works using three main mechanisms:
- 1 When a master and a replica instances are well-connected, the master keeps the replica updated by sending a stream of commands to the replica to replicate the effects on the dataset happening in the master side due to: client writes, keys expired or evicted, any other action changing the master dataset.
- 2 When the link between the master and the replica breaks, for network issues or because a timeout is sensed in the master or the replica, the replica reconnects and attempts to proceed with a partial resynchronization: it means that it will try to just obtain the part of the stream of commands it missed during the disconnection.
- 3 When a partial resynchronization is not possible, the replica will ask for a full resynchronization. This will involve a more complex process in which the master needs to create a snapshot of all its data, send it to the replica, and then continue sending the stream of commands as the dataset changes.