Jr. Site Reliability Engineer

Remote

Job Description / Skills Required

At Aera Technology, we are helping the largest enterprises in the world transform how they make and execute decisions with Decision Intelligence. Aera understands how your business works, makes real-time recommendations, predicts outcomes, and takes action autonomously. Our​ ​platform​ delivers the business agility required to respond to today’s ever-changing environment.
 
The SRE team supports the development, enhancement, and maintenance of the cloud infrastructure that our applications and services run on. Aera's SRE group manages the architecture and engineering of all environments from production and acceptance to sandbox and sales. The team develops infrastructure as code, monitoring solutions for the health, performance, and reliability of the Aera stack, and in general, “keeps the lights on” by providing tier III support for our 24/7 Platform Operations team. The SRE team is also on the front line of adopting and developing state-of-the-art infrastructure to continuously evolve the platform. 
 
The primary responsibilities for this role will be to use your background as an operations generalist to work closely with architecture, engineering and development leads from the early stages of design through identifying and resolving production issues that relate to infrastructure. You will gain experience and hands on knowledge of deploying, running and and managing Kubernetes as well as experience with monitoring and metrics collection solutions, Linux configuration management and deployment and providing tier III support for our 24×7 Platform Operations team. 
 
This is an excellent opportunity for an early in career, recent graduate to join our team and build your skills and career with a leader in the Decision Intelligence space.
 

Responsibilities

      • Developing code, deploying and providing tier III support for Aera's production infrastructure
      • Responding to production incidents and determining how we can prevent them in the future
      • Triaging and troubleshooting production issues to ensure reliability and performance
      • Identifying and automating manual processes
      • Continuously evolving our monitoring tools and platform
      • Promoting and applying best practices for building scalable and reliable services across engineering
      • Developing and maintaining technical documentation/diagrams, runbooks, and procedures
      • Tier III support for a 24×7 online environment as part of an on-call rotation providing response to production incidents and participating in root cause analysis and problem management
 

About You

    • Bachelor of Science in Computer Science or related field is required
    • 1+ years of SRE/DevOps/infrastructure experience
    • 1+ years of experience deploying, operating and debugging server software on Linux at scale
    • Experience using any flavor of Kubernetes is highly desirable
    • Curious personality and driven to identify the root cause of infrastructure issues and helping to resolve them
    • Exposure to automating and running large scale production Java services in AWS or other cloud providers, ideally using containerization technologies such as Docker
    • Experience with the use, maintenance or configuration of monitoring, metrics and logging infrastructure (ELK, Promethius/Grafana, Nagios, etc.)
    • Enthusiasm for systems automation, configuration management and orchestration tools such as Ansible, Terraform and Crossplane and interest in automating and streamlining tasks in an SRE/Operations engineering context using scripting languages such as Python, Go, Ruby, etc…
At Aera, we're on a mission to solve the biggest, most intractable challenges in the world of enterprise software. We envision the rise of the Self-Driving Enterprise: a more autonomously functioning business with a central operating system that connects and orchestrates business operations. Our Cognitive Operating System is increasingly used by the world's largest companies to fundamentally transform their organizations and how work is done.
If you share our passion for building the next generation of enterprise software, and deploying it for the most sophisticated customers in the world, you’ve met your match. Headquartered in Mountain View, California, we're growing fast, with teams in Mountain View and San Francisco (California), Bucharest and Cluj-Napoca (Romania), Paris (France), Munich (Germany), London (UK), Pune and Bangalore (India), Sydney (Australia) and Singapore.  So join us, and let’s build the future of work together!