Why Redis Pub/Sub is not SQS, and Vice Versa
Redis has a pub/sub feature whereby there are publishers and subscribers, and publishers can fanout messages to subscribers. SQS [ amazon’s simple queue service] has message senders and receivers. They can both be useful, but in practice they produce different results.
How Redis Pub/Sub Works: An Example
Every subscriber to a channel in redis will receive the message sent by the publisher. I can demonstrate this with a simple example. Let’s say you start up a local redis instance with docker, something like:
docker run --rm -d -p 127.0.0.1:6379:6379/tcp redis
814a1bcb5abcb17dcb1b657fd3e8563ada0a2b1d9df5e4f1b09c229139f8964a
Now you connect to redis from two different terminals and subscribe to the same channel:
# first terminal
$ redis-cli
127.0.0.1:6379> SUBSCRIBE a-channel
Reading messages... (press Ctrl-C to quit)
1) "subscribe"
2) "a-channel"
3) (integer) 1
# second terminal
$ redis-cli
127.0.0.1:6379> SUBSCRIBE a-channel
Reading messages... (press Ctrl-C to quit)
1) "subscribe"
2) "a-channel"
3) (integer) 1
In a third terminal, you publish to a-channel, which is what we are subscribing to in the previously opened two terminals:
$ redis-cli
127.0.0.1:6379> PUBLISH a-channel "everyone sees this"
(integer) 2
The returned number is 2, which means two subscribers received this response. Here is the terminal output from both of them:
# first terminal
1) "message"
2) "a-channel"
3) "everyone sees this"
# second terminal
1) "message"
2) "a-channel"
3) "everyone sees this"
This means that, if you’re running multiple instances of the same application, then every instance of the application will get your message.
How SQS Works: An Example
Let’s say you start up a local SQS mock using localstack, with a docker-compose.yaml like so:
version: '2.1'
services:
localstack:
container_name: "${LOCALSTACK_DOCKER_NAME-localstack_main}"
image: localstack/localstack
ports:
- "4566-4599:4566-4599"
- "${PORT_WEB_UI-8080}:${PORT_WEB_UI-8080}"
environment:
- SERVICES=${SERVICES- }
- DEBUG=${DEBUG- }
- DATA_DIR=${DATA_DIR- }
- PORT_WEB_UI=${PORT_WEB_UI- }
- LAMBDA_EXECUTOR=${LAMBDA_EXECUTOR- }
- KINESIS_ERROR_PROBABILITY=${KINESIS_ERROR_PROBABILITY- }
- DOCKER_HOST=unix:///var/run/docker.sock
- HOST_TMP_FOLDER=${TMPDIR}
volumes:
- "${TMPDIR:-/tmp/localstack}:/tmp/localstack"
- "/var/run/docker.sock:/var/run/docker.sock"
You then bring up your localstack container with:
docker-compose up -d
Now let’s repeat what is effectively the same experiment. First, if you’re following along, you need to make sure you have credentials or your aws cli tool will complain:
export AWS_SECRET_ACCESS_KEY="FAKE"
export AWS_ACCESS_KEY_ID="FAKE"
export AWS_DEFAULT_REGION=us-east-1
Note that you’ll also unfortunately need to specify those environment variables for each of the three terminals we are about to open. In the first and second terminal, run this command to pull a message off of the SQS queue as soon as one becomes available:
# first and second terminal
aws --endpoint-url http://localhost:4566 sqs receive-message --queue-url http://localhost:4566/000000000000/test-queue --wait-time-seconds 20
While those two commands are waiting for a response [note: you have 20 seconds until they just give up, so you might need to rerun those before this final one to really get the point of this article], in your third terminal run this:
aws --endpoint-url http://localhost:4566 sqs send-message --message-body "hello" --queue-url http://localhost:4566/0000000000/test-queue
What should happen under low throughput, from “real” SQS, is that one of the clients trying to get a message actually get it. The other client is still waiting. SQS guarantees at-least-once delivery, but under normal circumstances and most of the time you will get about-once delivery. On my computer, I saw this:
# first terminal
$ aws --endpoint-url http://localhost:4566 sqs receive-message --queue-url http://localhost:4566/000000000000/test-queue --wait-time-seconds 20
{
"Messages": [
{
"MessageId": "ca46e741-6522-b27b-e454-76f514a9a851",
"ReceiptHandle": "mkdoaefeokewmrbqxmhnovuokoslnvwhkcxnkzafvxuwzbrkritnqweummmquejiajlcjkpnzhezwolaodklfrutbjeutggqfoefaypapvzihvhbuvfohxjzjxdtkmhclnxqdmiqwkrykghrmywomdlitbnsxjgkhojdoeynpxjpgoujeyxkhovsi",
"MD5OfBody": "5d41402abc4b2a76b9719d911017c592",
"Body": "hello",
"Attributes": {
"SenderId": "AIDAIT2UOQQY3AUEKVGXU",
"SentTimestamp": "1619914864555",
"ApproximateReceiveCount": "1",
"ApproximateFirstReceiveTimestamp": "1619914864566"
}
}
]
}
# second terminal
$ aws --endpoint-url http://localhost:4566 sqs receive-message --queue-url http://localhost:4566/000000000000/test-queue --wait-time-seconds 20
$ # timed out waiting for a response
What this means in practice
So, which one should I use? Well, it depends on whether you want all instances of your application to get the message or not. If your just sending a message as a job [to do a piece of work, like update the database], then you want SQS [both because of its about-once-delivery characteristics, but also because of its durability guarantees, which redis does not offer because it’s in memory]. If you need to send a message to every instance in your application, for example to invalidate an in memory cache, then you probably want to use redis pub/sub.