Streamline Occasion-driven Microservices With Kafka and Python


For a lot of vital utility features, together with streaming and e-commerce, monolithic structure is not adequate. With present calls for for real-time occasion knowledge and cloud service utilization, many trendy functions, corresponding to Netflix and Lyft, have shifted to an event-driven microservices method. Separated microservices can function independently of each other and improve a code base’s adaptability and scalability.

However what’s an event-driven microservices structure, and why do you have to use it? We’ll study the foundational points and create an entire blueprint for an event-driven microservices challenge utilizing Python and Apache Kafka.

Utilizing Occasion-driven Microservices

Occasion-driven microservices mix two trendy structure patterns: microservices architectures and event-driven architectures. Although microservices can pair with request-driven REST architectures, event-driven architectures have gotten more and more related with the rise of massive knowledge and cloud platform environments.

What Is a Microservices Structure?

A microservices structure is a software program growth method that organizes an utility’s processes as loosely coupled providers. It’s a sort of service-oriented structure (SOA).

In a standard monolithic construction, all utility processes are inherently interconnected; if one half fails, the system goes down. Microservices architectures as an alternative group utility processes into separate providers interacting with light-weight protocols, offering improved modularity and higher app maintainability and resiliency.

Microservices architecture (with UI individually connected to separate microservices) versus monolithic architecture (with logic and UI connected).
Microservices Structure vs. Monolithic Structure

Although monolithic functions could also be less complicated to develop, debug, take a look at, and deploy, most enterprise-level functions flip to microservices as their commonplace, which permits builders to personal elements independently. Profitable microservices must be stored so simple as attainable and talk utilizing messages (occasions) which are produced and despatched to an occasion stream or consumed from an occasion stream. JSON, Apache Avro, and Google Protocol Buffers are frequent selections for knowledge serialization.

What Is an Occasion-driven Structure?

An event-driven structure is a design sample that buildings software program in order that occasions drive the conduct of an utility. Occasions are significant knowledge generated by actors (i.e., human customers, exterior functions, or different providers).

Our instance challenge options this structure; at its core is an event-streaming platform that manages communication in two methods:

  • Receiving messages from actors that write them (often referred to as publishers or producers)
  • Sending messages to different actors that learn them (often referred to as subscribers or customers)

In additional technical phrases, our event-streaming platform is software program that acts because the communication layer between providers and permits them to alternate messages. It will probably implement quite a lot of messaging patterns, corresponding to publish/subscribe or point-to-point messaging, in addition to message queues.

A producer sending a message to an event-streaming platform, which sends the message to one of three consumers.
Occasion-driven Structure

Utilizing an event-driven structure with an event-streaming platform and microservices presents a wealth of advantages:

  • Asynchronous communications: The power to independently multitask permits providers to react to occasions at any time when they’re prepared as an alternative of ready on a earlier process to complete earlier than beginning the subsequent one. Asynchronous communications facilitate real-time knowledge processing and make functions extra reactive and maintainable.
  • Full decoupling and adaptability: The separation of producer and shopper elements implies that providers solely have to work together with the event-streaming platform and the info format they’ll produce or eat. Providers can comply with the single accountability precept and scale independently. They’ll even be applied by separate growth groups utilizing distinctive know-how stacks.
  • Reliability and scalability: The asynchronous, decoupled nature of event-driven architectures additional amplifies app reliability and scalability (that are already benefits of microservices structure design).

With event-driven architectures, it’s simple to create providers that react to any system occasion. You can too create semi-automatic pipelines that embody some guide actions. (For instance, a pipeline for automated consumer payouts may embody a guide safety verify triggered by unusually massive payout values earlier than transferring funds.)

Selecting the Mission Tech Stack

We are going to create our challenge utilizing Python and Apache Kafka paired with Confluent Cloud. Python is a strong, dependable commonplace for a lot of forms of software program tasks; it boasts a big group and plentiful libraries. It’s a sensible choice for creating microservices as a result of its frameworks are suited to REST and event-driven functions (e.g., Flask and Django). Microservices written in Python are additionally generally used with Apache Kafka.

Apache Kafka is a widely known event-streaming platform that makes use of a publish/subscribe messaging sample. It’s a frequent selection for event-driven architectures attributable to its intensive ecosystem, scalability (the results of its fault-tolerance skills), storage system, and stream processing skills.

Lastly, we are going to use Confluent as our cloud platform to effectively handle Kafka and supply out-of-the-box infrastructure. AWS MSK is one other glorious possibility when you’re utilizing AWS infrastructure, however Confluent is less complicated to arrange as Kafka is the core a part of its system and it presents a free tier.

Implementing the Mission Blueprint

We’ll arrange our Kafka microservices instance in Confluent Cloud, create a easy message producer, then set up and enhance it to optimize scalability. By the top of this tutorial, we can have a functioning message producer that efficiently sends knowledge to our cloud cluster.

Kafka Setup

We’ll first create a Kafka cluster. Kafka clusters host Kafka servers that facilitate communication. Producers and customers interface with the servers utilizing Kafka subjects (classes storing information).

  1. Join Confluent Cloud. When you create an account, the welcome web page seems with choices for creating a brand new Kafka cluster. Choose the Primary configuration.
  2. Select a cloud supplier and area. It is best to optimize your selections for one of the best cloud ping outcomes out of your location. One possibility is to decide on AWS and carry out a cloud ping take a look at (click on HTTP Ping) to determine one of the best area. (For the scope of our tutorial, we are going to go away the “Single zone” possibility chosen within the “Availability” subject.)
  3. The subsequent display asks for a cost setup, which we will skip since we’re on a free tier. After that, we are going to enter our cluster identify (e.g., “MyFirstKafkaCluster”), affirm our settings, and choose Launch cluster.
The Confluent “Create cluster” screen with various configuration choices for the “MyFirstKafkaCluster” cluster and a “Launch cluster” button.
Kafka Cluster Configuration

With a working cluster, we’re able to create our first subject. Within the left-hand menu bar, navigate to Matters and click on Create subject. Add a subject identify (e.g., “MyFirstKafkaTopic”) and proceed with the default configurations (together with setting six partitions).

Earlier than creating our first message, we should arrange our shopper. We are able to simply Configure a shopper from our newly created subject overview (alternatively, within the left-hand menu bar, navigate to Shoppers). We’ll use Python as our language after which click on Create Kafka cluster API key.

The Confluent Clients screen showing step 2 (client code configuration) with the Kafka cluster API key setup and the configuration code snippet.
Kafka Cluster API Key Setup

At this level, our event-streaming platform is lastly able to obtain messages from our producer.

Easy Message Producer

Our producer generates occasions and sends them to Kafka. Let’s write some code to create a easy message producer. I like to recommend organising a digital atmosphere for our challenge since we will probably be putting in a number of packages in the environment.

First, we are going to add the environment variables from the API configuration from Confluent Cloud. To do that in our digital atmosphere, we’ll add export SETTING=worth for every setting under to the top of our activate file (alternatively, you possibly can add SETTING=worth to your .env file):

export KAFKA_BOOTSTRAP_SERVERS=<bootstrap.servers>
export KAFKA_SECURITY_PROTOCOL=<safety.protocol>
export KAFKA_SASL_MECHANISMS=<sasl.mechanisms>
export KAFKA_SASL_USERNAME=<sasl.username>
export KAFKA_SASL_PASSWORD=<sasl.password>

Ensure to interchange every entry together with your Confluent Cloud values (for instance, <sasl.mechanisms> must be PLAIN), together with your API key and secret because the username and password. Run supply env/bin/activate, then printenv. Our new settings ought to seem, confirming that our variables have been accurately up to date.

We will probably be utilizing two Python packages:

We’ll run the command pip set up confluent-kafka python-dotenv to put in these. There are various different packages for Kafka in Python which may be helpful as you broaden your challenge.

Lastly, we’ll create our primary producer utilizing our Kafka settings. Add a simple_producer.py file:

# simple_producer.py
import os

from confluent_kafka import KafkaException, Producer
from dotenv import load_dotenv

def principal():
    settings = {
        'bootstrap.servers': os.getenv('KAFKA_BOOTSTRAP_SERVERS'),
        'safety.protocol': os.getenv('KAFKA_SECURITY_PROTOCOL'),
        'sasl.mechanisms': os.getenv('KAFKA_SASL_MECHANISMS'),
        'sasl.username': os.getenv('KAFKA_SASL_USERNAME'),
        'sasl.password': os.getenv('KAFKA_SASL_PASSWORD'),
    }

    producer = Producer(settings)
    producer.produce(
        subject='MyFirstKafkaTopic',
                      key=None,
                      worth='MyFirstValue-111',
    )
    producer.flush()  # Anticipate the affirmation that the message was acquired

if __name__ == '__main__':
    load_dotenv()
    principal()

With this easy code we create our producer and ship it a easy take a look at message. To check the consequence, run python3 simple_producer.py:

Confluent’s Cluster Overview dashboard, with one spike appearing in the Production (bytes/sec) and Storage graphs, and no data shown for Consumption.
First Take a look at Message Throughput and Storage

Checking our Kafka cluster’s Cluster Overview > Dashboard, we are going to see a brand new knowledge level on our Manufacturing graph for the message despatched.

Customized Message Producer

Our producer is up and operating. Let’s reorganize our code to make our challenge extra modular and OOP-friendly. It will make it simpler so as to add providers and scale our challenge sooner or later. We’ll cut up our code into 4 recordsdata:

  • kafka_settings.py: Holds our Kafka configurations.
  • kafka_producer.py: Incorporates a customized produce() methodology and error dealing with.
  • kafka_producer_message.py: Handles totally different enter knowledge varieties.
  • advanced_producer.py: Runs our remaining app utilizing our customized courses.

First, our KafkaSettings class will encapsulate our Apache Kafka settings, so we will simply entry these from our different recordsdata with out repeating code:

# kafka_settings.py
import os

class KafkaSettings:
    def __init__(self):
                      self.conf = {
            'bootstrap.servers': os.getenv('KAFKA_BOOTSTRAP_SERVERS'),
            'safety.protocol': os.getenv('KAFKA_SECURITY_PROTOCOL'),
            'sasl.mechanisms': os.getenv('KAFKA_SASL_MECHANISMS'),
            'sasl.username': os.getenv('KAFKA_SASL_USERNAME'),
            'sasl.password': os.getenv('KAFKA_SASL_PASSWORD'),
        }

Subsequent, our KafkaProducer permits us to customise our produce() methodology with help for numerous errors (e.g., an error when the message dimension is just too massive), and likewise mechanically flushes messages as soon as produced:

# kafka_producer.py
from confluent_kafka import KafkaError, KafkaException, Producer

from kafka_producer_message import ProducerMessage
from kafka_settings import KafkaSettings

class KafkaProducer:
    def __init__(self, settings: KafkaSettings):
        self._producer = Producer(settings.conf)

    def produce(self, message: ProducerMessage):
        attempt:
            self._producer.produce(message.subject, key=message.key, worth=message.worth)
            self._producer.flush()
        besides KafkaException as exc:
            if exc.args[0].code() == KafkaError.MSG_SIZE_TOO_LARGE:
                cross  # Deal with the error right here
            else:
                increase exc

In our instance’s try-except block, we skip over the message whether it is too massive for the Kafka cluster to eat. Nevertheless, you need to replace your code in manufacturing to deal with this error appropriately. Discuss with the confluent-kafka documentation for a whole record of error codes.

Now, our ProducerMessage class handles various kinds of enter knowledge and accurately serializes them. We’ll add performance for dictionaries, Unicode strings, and byte strings:

# kafka_producer_message.py
import json

class ProducerMessage:
    def __init__(self, subject: str, worth, key=None) -> None:
        self.subject = f'{subject}'
        self.key = key
        self.worth = self.convert_value_to_bytes(worth)

    @classmethod
    def convert_value_to_bytes(cls, worth):
        if isinstance(worth, dict):
            return cls.from_json(worth)

        if isinstance(worth, str):
            return cls.from_string(worth)

        if isinstance(worth, bytes):
            return cls.from_bytes(worth)

        increase ValueError(f'Improper message worth sort: {sort(worth)}')

    @classmethod
    def from_json(cls, worth):
        return json.dumps(worth, indent=None, sort_keys=True, default=str, ensure_ascii=False)

    @classmethod
    def from_string(cls, worth):
        return worth.encode('utf-8')

    @classmethod
    def from_bytes(cls, worth):
        return worth

Lastly, we will construct our app utilizing our newly created courses in advanced_producer.py:

# advanced_producer.py
from dotenv import load_dotenv

from kafka_producer import KafkaProducer
from kafka_producer_message import ProducerMessage
from kafka_settings import KafkaSettings

def principal():
    settings = KafkaSettings()
    producer = KafkaProducer(settings)
    message = ProducerMessage(
        subject='MyFirstKafkaTopic',
        worth={"worth": "MyFirstKafkaValue"},
        key=None,
    )
    producer.produce(message)

if __name__ == '__main__':
    load_dotenv()
    principal()

We now have a neat abstraction above the confluent-kafka library. Our customized producer possesses the identical performance as our easy producer with added scalability and adaptability, able to adapt to numerous wants. We may even change the underlying library totally if we wished to, which units our challenge up for achievement and long-term maintainability.

Confluent’s Cluster Overview dashboard: Production shows two spikes, Storage shows two steps (with horizontal lines), and Consumption shows no data.
Second Take a look at Message Throughput and Storage

After operating python3 advanced_producer.py, we see but once more that knowledge has been despatched to our cluster within the Cluster Overview > Dashboard panel of Confluent Cloud. Having despatched one message with the easy producer, and a second with our customized producer, we now see two spikes in manufacturing throughput and a rise in total storage used.

Trying Forward: From Producers to Shoppers

An event-driven microservices structure will improve your challenge and enhance its scalability, flexibility, reliability, and asynchronous communications. This tutorial has given you a glimpse of those advantages in motion. With our enterprise-scale producer up and operating, sending messages efficiently to our Kafka dealer, the subsequent steps could be to create a shopper to learn these messages from different providers and add Docker to our utility.

The editorial staff of the Toptal Engineering Weblog extends its gratitude to E. Deniz Toktay for reviewing the code samples and different technical content material offered on this article.

Leave a Reply