<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Cluster Management</title><link>https://blog.shuaizhang.cc/en-us/tags/cluster-management/</link><description>Posts gathered under this term.</description><generator>Hugo</generator><language>en-US</language><lastBuildDate>Wed, 29 Aug 2018 23:07:13 +0000</lastBuildDate><atom:link href="https://blog.shuaizhang.cc/en-us/tags/cluster-management/index.xml" rel="self" type="application/rss+xml"/><item><title>Large-Scale Cluster Management and Operations Automation</title><link>https://blog.shuaizhang.cc/en-us/posts/cluster-management-and-devops/</link><pubDate>Wed, 29 Aug 2018 00:00:00 +0000</pubDate><guid>https://blog.shuaizhang.cc/en-us/posts/cluster-management-and-devops/</guid><description>This article discusses core issues in large-scale cluster operations automation, including automatic fault detection, automatic remediation, and safety.</description></item><item><title>Paper Notes: [Operating Systems Review 2007] Autopilot: Automatic Data Center Management</title><link>https://blog.shuaizhang.cc/en-us/posts/microsoft-autopilot/</link><pubDate>Sun, 15 Jul 2018 00:00:00 +0000</pubDate><guid>https://blog.shuaizhang.cc/en-us/posts/microsoft-autopilot/</guid><description>This article outlines the design goals of Microsoft's Autopilot cluster management system, as well as its mechanisms for machine lifecycle management, application deployment, and automated operations.</description></item></channel></rss>