Complex Project: AWS ECS Cluster with Internal AWS NLB and Akka Cluster Discovery Protocol

Blog Our experience Complex Project: AWS ECS Cluster with Internal AWS NLB and Akka Cluster Discovery Protocol

Feb 4, 2025

2 min read

88 views

Challenge:

We deployed an AWS ECS Cluster using Terraform, with multiple application components communicating via AWS Network Load Balancer (NLB). The NLB was chosen because the application used a custom protocol (AKKA Cluster Discovery Protocol over TCP), which AWS ALB does not support.

Initially, everything worked fine with an external-facing NLB, but the client requested to restrict access to the application to their internal network only. This required switching the NLB type to internal. After doing so, external access was restricted as expected, but some application components within ECS lost connectivity.

Investigation and Root Cause:

Issue: Components running on the same physical AWS ECS worker could not communicate via the internal AWS NLB.
Observation: Components running on different ECS workers could still communicate.
Root Cause: AWS NLB does not support loopback traffic—a container on a worker cannot reach another container on the same worker via the NLB.
Confirmation: The issue was verified through deep analysis and discussions with AWS Support.

Solution:

Switched ECS network mode to awsvpc instead of the default bridge mode.
Ensured each container received its own private IP address and connected directly to the NLB using its own IP.
This allowed all components to communicate consistently via the internal NLB, regardless of whether they were on the same or different ECS workers.

Final Outcome:

Fully functional internal-only ECS Cluster with NLB.
All components successfully connected, ensuring high availability and security.
If HTTP/HTTPS was used instead of Akka, AWS ALB would have been the ideal choice since it supports security groups, but NLB was required for the custom TCP protocol.

This project demonstrated the importance of understanding AWS networking intricacies, especially when using NLB with internal-only traffic in ECS.🚀

Our experience

Feb 4, 2025

2 min read

107 views

Optimizing Laravel with PHP OPcache: How to Reduce Page Load Time by Half

Laravel applications often contain a large number of files that need to be interpreted with each server request. In my project, the number of files reached 20,000, causing significant disk load and affecting page load speed. The solution to this problem was using PHP OPcache, which significantly reduced IOPS and doubled the speed of request […]

By admin

Our experience

Feb 4, 2025

2 min read

103 views

Example of AWS-Based Infrastructure Overview: Development, Staging, and Production Environments with CI/CD Integration.

The provided diagram illustrates three similar environments: development, staging, and production. Development and staging are hosted in the same AWS region, while production can optionally be deployed in a region of choice. The infrastructure includes the following components: AWS ECS (or optionally AWS Elastic Beanstalk) for application orchestration. AWS RDS for relational database management. AWS […]

By admin

Our experience

Feb 4, 2025

3 min read

149 views

Scaling a PHP Application: Optimizing Apache2 to Handle 500 Concurrent Users

By admin