The core problem stemmed from processing large files in the build context, which led to increased memory usage and, at times, network latency between regions.
To ensure long-term stability and prevent recurrence, our team implemented a series of comprehensive structural improvements focused on reducing memory usage, improving stream handling, and modernizing our codebase:
stream.promises.pipeline
, Transform
, Passthrough
) to replace older, archived dependencies, which reduces the likelihood of memory leaks and edge cases.We realize the recent incidents can be very disruptive and understand your frustration. Each of these incidents as well as their cause and steps to resolution have been documented on our status page. You can view the recent ones here on the history page with links to the details about each event: https://status.balena.io/history.
As you may know, the recent frequency of incidents is historically unusual for us. Our DevOps team worked overtime to resolve the issues as they arose. Our general process is not only to resolve, but also involve the broader team to make careful structural changes as preventative measures. For instance, we are reducing our reliance on hosting partners that may suffer from regional network network issues.
Thank you for your patience.