Back to jobs

Site Reliability Engineer

DuckDuckGo | Remote


Apply


We're on the lookout for a new member in Operations!

The DuckDuckGo Operations team primarily performs two critical functions:

1. Keep the search engine available online and return results quickly.

2. Streamline development environments used by our growing staff to collaborate on software updates and test their changes.

While these two objectives may seem dissimilar they are integrally connected.

By reducing friction in the development workflow we empower the developers to continue to make significant improvements to the product. This translates to an enriched search experience for our end users. By keeping the search engine available and stable we help millions of people get the results they need when they turn to DuckDuckGo for answers. The team uses a variety of tools to help achieve these goals, and we're always striving for improvement. By joining a small team at this stage you will have the opportunity to help shape our growing infrastructure!

As a member of the team your colleagues will rely on you for help. We all work together to ensure site reliability, and the developers turn to us when issues appear in their environment. This also means that we engage in active monitoring and response -- we share the burden of on call responsibilities and are always available to help each other in a crisis. We all enjoy problem solving and hope you do too!

Typical Responsibilities:

  • Perform deep dives into reliability issues; partner with software and ops engineers to produce and roll out fixes

  • Troubleshoot issues across the entire stack

  • Identify opportunities to improve automation for the company; scope and create automation for deployment, management, and visibility of our services

  • Help determine the future technical direction of our deployment

  • Collaborate with engineers to improve development workflow and tools
  • Technical Requirements:

  • Expert level understanding of Linux servers

  • Adept knowledge of shell scripting and a higher level language (Perl preferred)

  • Experience with automation tools (Chef preferred)

  • Experience working on an on-call ops team

  • Ability to prioritize tasks and work independently as part of a remote team

  • Must be adaptable and able to focus on the simplest, most efficient and reliable solutions
  • Some Desired Skills:

  • Experience deploying to public clouds (preferably AWS)

  • Experience or familiarity with the following projects: PostgreSQL, Couchbase/memcached, and Apache Solr
  • Please see https://duck.co/help/company/hiring for general details about DuckDuckGo hiring and how to apply for positions.


    Learn More

    Back