Jan. 15, 2021

Infrastructure Reliability Engineer - Senior Staff Engineer -- Remote Opportunity

KOHLS Menomonee Falls, Wisconsin

Our purpose as a company is to inspire and empower families to lead fulfilled lives–to let our customers know that the things that make their lives better are within their reach. We’re down to earth, but we’re up for the challenge. We are Transforming Infrastructure We are in the process of transforming infrastructure at Kohl's by building an enterprise platform that will empower Kohl's development teams to rapidly build high-quality, modern technology solutions that help differentiate us in the marketplace. We design and build reliability into our products and believe that a methodical, steady, relentless forward momentum drives consistent results. Responsible for: \- Demonstrating operational excellence by leading major automation, toil reduction initiatives, and simplifying ecosystems \- Guiding product teams to build resilient and observable architectures \- Managing a small group of reliability engineers while partnering with cross-functional teams to provide leadership on reliability initiatives \- Setting the vision and driving cultural transformation within the team \- Leading engineering efforts to build products helping infrastructure product teams to improve their own reliability \- Advising product teams within your domain on capacity planning, chaos engineering, and DR/HA \- Working with product owners within our infrastructure domain to define reliability best practices, SLOs, and error budgetsRequired: \- Bachelor's Degree or equivalent in MIS, Computer Science or related field \- Must be willing to have fun and not take themselves too seriously \- 6+ years of experience in software development \- Have strong programming skills in Python, Ansible, and Groovy \- Ability to see opportunity from the perspective of the customers we support \- Proven ability to manage multiple competing priorities \- Advanced in-depth knowledge of application design patterns, event-driven architecture, database schemas, and testing strategies \- Able to influence cross team and willing to tailor your approach to the audience you have \- Demonstrated experience with large scale application troubleshooting and performance tuning \- Demonstrated experience working with at least one major cloud platforms (GCP, AWS, or Azure) \- Deep experience in one of more Observability platforms - Prometheus, Grafana, Dynatrace, or Big Panda \- Deep experience in at least one PaaS or container management platform - Openshift, Kubernetes or equivalent \- Deep experience with one or more configuration management systems like Chef, Ansible, Puppet, or GitOps Preferred: \- Advanced in-depth knowledge and experience with continuous integration, continuous deployment, and test driven development \- Advanced Deep understanding of systems architecture, UNIX internals, networking topologies, multi-cluster applications, multi-tenant platforms, and systems/network security

Create an account to see the full posting, access our search engine, and more.

Looking For Similar Jobs?