Browse Definitions :
Definition

Simian Army

The Simian Army is a collection of open source cloud testing tools created by the online video streaming company, Netflix. The tools allow engineers to test the reliability, security, resiliency and recoverability of the cloud services that Netflix runs on Amazon Web Services (AWS) infrastructure.

Netflix engineers started creating the autonomous software agents, which are called monkeys, soon after moving to the cloud with AWS. Each monkey is designed to help make Netflix's service less fragile and better able to support continuous service, with minimal degradation, when parts of the cloud experience random failures.

Members of the Simian Army include:

  • Chaos Monkey - randomly shuts down virtual machines (VMs) to ensure that small disruptions will not affect the overall service.
  • Latency Monkey - simulates a degradation of service and checks to make sure that upstream services react appropriately.
  • Conformity Monkey - detects instances that aren’t coded to best-practices and shuts them down, giving the service owner the opportunity to re-launch them properly.
  • Security Monkey - searches out security weaknesses, and ends the offending instances. It also ensures that SSL and DRM certificates are not expired or close to expiration.
  • Doctor Monkey - performs health checks on each instance and monitors other external signs of process health such as CPU and memory usage.
  • Janitor Monkey - searches for unused resources and discards them.

Each of these tools helps make cloud service less fragile and better able to support continuous service, with minimal degradation, when parts of the cloud have a problem. Potential problems can be detected and addressed. Furthermore, induced failures provide knowledge that can help prevent future failures and also provide guidance for dealing with any that do occur.

The word Netflix engineers continue to conceptualize and develop new monkeys and invite the community to do so as well.

 

 

This was last updated in August 2013

Continue Reading About Simian Army

SearchCompliance

  • information governance

    Information governance is a holistic approach to managing corporate information by implementing processes, roles, controls and ...

  • enterprise document management (EDM)

    Enterprise document management (EDM) is a strategy for overseeing an organization's paper and electronic documents so they can be...

  • risk assessment

    Risk assessment is the identification of hazards that could negatively impact an organization's ability to conduct business.

SearchSecurity

  • cracker

    A cracker is someone who breaks into someone else's computer system, often on a network; bypasses passwords or licenses in ...

  • Malware quiz: Test your knowledge of types and terms

    Malware trends are constantly evolving, but older techniques are still often used in cyber attacks today. Test your knowledge of ...

  • digital signature

    A digital signature is a mathematical technique used to validate the authenticity and integrity of a message, software or digital...

SearchHealthIT

SearchDisasterRecovery

  • risk mitigation

    Risk mitigation is a strategy to prepare for and lessen the effects of threats faced by a business.

  • call tree

    A call tree is a layered hierarchical communication model that is used to notify specific individuals of an event and coordinate ...

  • Disaster Recovery as a Service (DRaaS)

    Disaster recovery as a service (DRaaS) is the replication and hosting of physical or virtual servers by a third party to provide ...

SearchStorage

  • storage virtualization

    Storage virtualization is the pooling of physical storage from multiple storage devices into what appears to be a single storage ...

  • erasure coding

    Erasure coding (EC) is a method of data protection in which data is broken into fragments, expanded and encoded with redundant ...

  • continuous data protection

    Continuous data protection (CDP), also known as continuous backup, is a backup and recovery storage system in which all the data ...

Close