Intelligent Process Automation in Banking: Think Resilience

Intelligent process automation in banking brings in a scalable, seamless & secure solution that redefines banking across all processes, experiences & customers. Are you ready to scale up?

Software Architecture and Design InfoQ Trends Report - April 2021

In 2021, the innovators are finding ways to provide both the run-time and design-time observability benefits almost automatically. An overview of how the InfoQ editorial team sees the Software Architecture and Design topic evolving in 2021, with a focus on what architects are designing for today. Software Architecture and Design InfoQ Trends Report—April 2021.

How to Improve Resilience in Kubernetes with Advanced Traffic Management

How to Improve Resilience in Kubernetes with advanced traffic management. Improve the resilience of Kubernetes apps with the traffic control and splitting methods discussed in this blog –Kubernetes Apps and their resilience is a mature traffic management strategy.

A Short Note on Cascading Failures in MicroServices Architecture

In this lesson, we are going to learn how to a short note on Cascading Failures in MicroServices Architecture. There are many such advantages that the newer microservices architecture brings to us but there is one thing and one such cascading failure that we experienced a few weeks ago.

Estimating Building Heights Using LiDAR Data

Abhineet Gupta, Director of Resilience Research at One Concern. I intend to provide a more up-to-date set of tools and processes to use LiDAR data to extract building elevations, using San Francisco as an example.

Growing Resilience: Serving Half a Billion Users Monthly at Condé Nast

Crystal Hirschorn outlines how Condé Nast practices Chaos Engineering, where this fits within the already established testing and verification ecosystem, and what emergent practices and tools are on the horizon. Last but not least, she covers how to build up an organization’s true superpower: Human Resilience.

Navigating Complex Software Projects and Leading in Uncertain Times: InfoQ Live.

InfoQ Live brings together world-class practitioners, such as John Willis, Senior Director in Red Hat's Global Transformation Office, and Sarah Wells, Technical Director for Operations and Reliability @FT, to share their valuable insights and practical advice on software engineering leadership.

Implement Health Checks for Kubernetes in Your Application

In this article, we are going to implement health checks and corresponding probes so that Kubernetes is able to define whether a containerized application is healthy and can act accordingly.

Delivering Technology Through Software Engineering Leadership: Upcoming InfoQ Live Event

InfoQ Live, the interactive virtual event designed for the modern software practitioner, returns on Sept 23rd with a new topic focus: delivering technology by software engineering leadership and by empowering teams. Join world-class practitioners and deep-dive into best practices for leading tech projects, analyzing team data dynamics, and leading teams in uncertain times.

Nora Jones on Resilience Engineering, Mental Models, and Learning

In this podcast, Nora Jones, Co-Founder and CEO at Jeli and co-author of O’Reilly’s “Chaos Engineering: System Resiliency in Practice”, sat down with InfoQ podcast co-host Daniel Bryant. Topics discussed included: chaos engineering and resilience engineering, planning and running effective chaos experiments, and learning from incidents.

The Importance of Reliability Engineering

We’ll look at three big benefits of investing in reliability and explain how you can get started on your journey to reliability excellence.

Reduce Engineering Problems With a Resiliency Mindset - DZone DevOps

Reduce Engineering Problems With a Resiliency Mindset: To reach your optimal state of resilience, there are some crucial SRE best practices you should adopt to strengthen your processes.

Applying Observability to Ship Faster

To get fast feedback, ship work often, as soon as it is ready, and use automated systems in Live to test the changes. Monitoring can be used to verify if things are good, and to raise an alarm if not. Shipping fast in this way can result in having fewer tests and can make you more resilient to problems.