Box's journey from operational fragility to fully automated monitoring
More than 41 million users and 74,000 businesses — including 59% of the Fortune 500 — trust Box to manage content in the cloud. We were monitoring this web scale infrastructure with Nagios, and not able to keep up with the rapid pace of change inside of Box. This is our migration story from wrestling with management of 350K objects in Nagios – including over 130K checks – to shutting down the last Nagios host roughly a year later.
Senior Infrastructure Site Reliability Engineer, Box, Inc.
Trent Baker is a Senior Infrastructure SRE at Box, Inc., responsible for designing, implementing, and maintaining infrastructure services such as configuration management, authentication, and alerting and monitoring. Trent designed and implemented Box’s next-generation alerting and monitoring system; positioning Box to quickly and easily monitor, debug, and auto-remediate enterprise-wide services deployed to bare-metal, private cloud, and public cloud systems.
Sign up for updates
Subscribe to our weekly newsletter to get product updates about Sensu.