How to Design and Test Elixir GenServers

Last updated:

January 22, 2026

7 min read

Elixir

Ihor Katkov

Software Engineer

Sofiia Yurkevska

Content Writer

Contents

This is some text inside of a div block.

The GenServer is a powerful abstraction for managing stateful processes and harnessing concurrency when working with Elixir. Often, not only newcomers but also experienced engineers are struggling with GenServer testing. In this article, we'll dive into ideas on how to properly design and test GenServers.

TL;DR

When you need GenServer: State management over time, concurrency control, background processing, resource management. If you don't need these, use simple structs and functions instead.
GenServer overhead: Requires starting/managing long-running process, inter-process communication for each operation, supervision/recovery handling, memory for state, callback implementation. Only use when necessary.
Three common design flaws: Business logic overload (GenServer should coordinate, not contain complex logic), treating GenServers like OOP objects (simple data management), ignoring single responsibility principle (one task per GenServer).
Better design: Keep GenServers thin—let them coordinate while business logic lives in separate modules (e.g., OrderService.process_order()). Makes testing easier and logic reusable.
Two testing strategies: Isolated callback testing (call callbacks directly for simple state transitions), live GenServer testing (run actual process for complex interactions like timeouts/retries).
Testing with external services: Use explicit contracts with behavior definitions, swap implementations via config (Mailer.Stub for tests), or pass adapters as options to GenServer for runtime control.
Bottom line: If testing feels difficult, your GenServer is doing too much. Keep it thin, separate business logic, use dependency injection. Well-designed GenServers are straightforward to test.

Do you really need it?

“Simple things should be simple, and complex things should be possible.”

Rich Hickey

Creator of Clojure

GenServer (Generic Server) is one of the core building blocks in Elixir applications, implementing the actor model for concurrent state management. It gives us a notion of when GenServers shine:

State Management

When you need to maintain state between requests (e.g., caching, counters)

Concurrency Control

When you need to serialize access to resources

Background Processing

When you need to handle long-running tasks

Resource Management

When you need to manage connections or limited resources

Before diving into GenServer implementation, always reconsider if you need it. A key question is: "Does my process need to manage state over time or deal with inter-process coordination?" If the answer is no, then there’s likely a better approach available. Many problems can be solved with simple structs and functions, avoiding the overhead and complexity of a full GenServer.

GenServer overhead

1defmodule CounterServer do
2
3  use GenServer
4
5  def start_link(initial_count) do
6    GenServer.start_link(__MODULE__, initial_count, name: __MODULE__)
7  end
8
9  def init(initial_count), do: {:ok, initial_count}
10
11  def increment, do: GenServer.call(__MODULE__, :increment)
12  
13  def handle_call(:increment, _from, count) do
14    {:reply, count + 1, count + 1}
15  end
16end

This GenServer implementation comes with several forms of overhead:

You need to start and manage a long-running process

Each operation requires inter-process communication

You need to handle process supervision and recovery

Each process maintains its state in memory

You need to implement callbacks and handle the process lifecycle

In cases where nothing from above is required, go for a more straightforward implementation:

Simpler alternative

1defmodule Counter do
2  defstruct count: 0
3
4  def increment(%Counter{count: count} = counter) do
5    %Counter{counter | count: count + 1}
6  end
7end

Why Proper Design Matters:

“If you can’t test it, it’s not a good design.”

Kent Beck

Creator of Extreme programming

One of the first principles to keep in mind when working with GenServers is the importance of design. With proper design, testing GenServers becomes at least possible and at most easier while the overall complexity of an application decreases.

Common design flaws:

Business Logic overload

The GenServer should act primarily as a coordinator, passing off complex business logic to external modules. By keeping GenServers thin, you can make testing easier, as the business logic can be tested independently of the process. Let's examine a common anti-pattern where business logic is directly embedded in the GenServer:

BL overload example

1defmodule OrderProcessor do
2  use GenServer
3
4  def handle_call({:process_order, order}, _from, state) do
5    # Complex business logic buried in GenServer
6    validated_order = validate_order(order)
7    total = calculate_total(validated_order)
8    updated_inventory = update_inventory(validated_order)
9    receipt = generate_receipt(validated_order, total)
10    
11    {:reply, receipt, Map.put(state, :inventory, updated_inventory)}
12  end
13  
14  # Many private functions implementing business logic...
15end

This implementation violates key principles of business logic separation. First, it creates testing complexity – business logic is trapped inside process management, each test requires process overhead, it's hard to test business rules in isolation, and difficult to simulate different business scenarios.

Second, it introduces maintainability issues – business rules are mixed with infrastructure concerns, changes to business logic risk affecting process stability, it's hard to adapt as business rules evolve, and difficult to reuse logic across different interfaces.

Third, there's context confusion – there's no clear separation between business and infrastructure layers, business rules become tied to process lifecycle, it's hard to implement new interfaces like API or CLI, and difficult to maintain consistent authorization.

Here's a better approach that separates process management from business logic:

Better approach

1defmodule OrderProcessor do
2  use GenServer
3  
4  def handle_call({:process_order, order}, _from, state) do
5    # GenServer only coordinates the process
6    {:ok, receipt, updated_inventory} = OrderService.process_order(order, state)
7
8    {:reply, receipt, Map.put(state, :inventory, updated_inventory)}
9  end
10end

Treating GenServers like OOP Objects (simple data management)

Misuse comes from developers with an object-oriented programming (OOP) background who treat GenServers like objects, trying to encapsulate state and business logic within them. This leads to overly complex and often untestable code.

Ignoring SRP

A well-designed GenServer should adhere to the single responsibility principle: it should focus on one task and do it well. This not only makes the GenServer more efficient, but it also simplifies testing. For example, in trading applications like those I work on, where real-time data streams need to be processed quickly, we assign each GenServer its specific task, such as processing orders or managing real-time market data. This helps ensure that no single GenServer becomes a bottleneck.

Lean on isolation and mocking for effective testing

Now that we've covered proper GenServer design – keeping them thin, avoiding business logic overload, and maintaining single responsibility – let's explore how these principles enable straightforward testing. When your GenServers are well-designed, testing becomes natural and follows two main strategies:

Isolated callback testing

When testing GenServers, you should apply a consistent strategy across different callbacks (handle_call, handle_cast, handle_info, ect). Testing these callbacks directly by calling them in isolation allows you to focus on the specific state transitions without the overhead of running a GenServer. This is particularly useful for simple tests where you need to validate that the correct state is returned for given inputs.

Live GenServer testing

For more complex interactions, such as timeouts or retries with external services, it’s often necessary to run the GenServer itself. Jose Valim established these testing strategies in his article about mocks and explicit contracts. Let's look at how to implement them:

Functional testing with live GenServer

When you deal with modules/services that you cannot control (or you don't want to control), you can wrap them into facades with explicit contracts and different adapters for different environments (test, dev, prod). Let's take a look at an example.

1defmodule Mailer.Adapter do
2  @callback send_email(to :: String.t(), subject :: String.t(), body :: String.t()) :: :ok
3end
4
5defmodule Mailer.Stub do
6  @behaviour Mailer.Adapter
7
8  def send_email(_to, _subject, _body), do: :ok
9end
10
11defmodule YourEmailService do
12  @behaviour Mailer.Adapter
13
14  def send_email(_to, _subject, _body) do
15  # you actually send an email here using your email service
16  end
17end

Let's say we have a GenServer which sends an email after processing an order. By having an explicit contract, we can easily swap the implementation for testing purposes.

Case 1, when we don't need to swap the implementation

In that case, we can use the Stub implementation.

1# somewhere in your test.exs file
2config :my_app, :mailer, Mailer.Stub
3
4defmodule OrderProcessor do
5  use GenServer
6
7  # in that case, Stub is baked into the GenServer, allowing us to focus on the business logic
8  @mailer Application.compile_env(:my_app, :mailer)
9
10  def handle_call({:process_order, order}, _from, state) do
11    # ...
12    @mailer.send_email(order.email, "Order Confirmation", "Thank you for your order!")
13    # ...
14  end
15end

Case 2, when we NEED to swap the implementation

In that case, we could pass the implementation as an option to the GenServer so that details could be changed on the test level.

1defmodule OrderProcessor do
2  def start_link(opts) do
3    mailer = Keyword.fetch!(opts, :mailer)
4    GenServer.start_link(__MODULE__, %{mailer: mailer}, name: __MODULE__)
5  end
6  # ...
7  def handle_call({:process_order, order}, _from, state) do
8    # ...
9    case state.mailer.send_email(order.email, "Order Confirmation", "Thank you for your order!") do
10      :ok -> # ...
11      {:error, reason} -> # ...
12    end
13    # ...
14  end
15end
16
17defmodule FailingMailer do
18  @behaviour Mailer.Adapter
19
20  def send_email(_to, _subject, _body), do: {:error, "Failed to send email"}
21end
22
23describe "handle_call/3" do
24  test "processes an order correctly" do
25    # ...
26    OrderProcessor.start_link(mailer: FailingMailer)
27    # ...
28  end
29end

Conclusion

What have we learned about working with GenServers?

First, not every problem needs a GenServer. Simple structs and functions are often enough – avoid the process management overhead unless you really need that persistent state or coordination between processes.

Second, when you do need a GenServer, keep it thin. Let it coordinate processes while keeping business logic elsewhere. You'll thank yourself later when testing and maintaining the code. Remember that mixing business rules with process management is a recipe for complexity.

Finally, testing well-designed GenServers doesn't have to be hard. Test simple state transitions by calling callbacks directly. Use proper contracts and dependency injection for more complex cases involving external services – either through configuration or runtime.

The bottom line? If testing feels difficult, your GenServer might be doing too much. Let that guide you toward better design decisions.