Reliability Engineer

This job is no longer active. View similar jobs.

POST DATE 9/3/2016
END DATE 12/19/2016

Next Step Systems New York, NY

New York, NY
AJE Ref #
Job Classification
Full Time
Job Type
Company Ref #



New York NY Jobs, Reliability Engineer, Linux, UNIX, Scripting, Open Source, New York Recruiters, Information Technology Jobs, IT Jobs, New York Recruiting


Will relocate the right candidate. Will Sponsor Visa's. Will only consider candidates from top tier computer science universities and/or individuals with a stellar GPA. Bachelor's and/or Master's degree from a top computer science program with a GPA of 3.5 or higher. PhD preferred. Top computer science program preferred (Carnegie Mellon University, Massachusetts Institute of Technology/MIT, Stanford University, University of California-Berkeley, Cornell University, University of Illinois-Urbana-Champaign, Princeton University, University of Washington, University of Texas-Austin, Georgia Institute of Technology, California Institute of Technology, University of Wisconsin-Madison, University of Michigan-Ann Arbor, etc.

As a member of this versatile group of full stack engineers, you will be on the front line for maintaining and expanding the capabilities of many and varied systems. The team exists in the space between traditional systems administration and development, and seeks to merge the capabilities from both disciplines.


-Act as a conduit between infrastructure and development teams, being sympathetic to the concerns and priorities of both.

-Primary operational support for multiple large distributed software applications.

-Improve all aspects of software reliability, including better monitoring, alerting and documentation.

-Engage with software engineering teams on support issues and improvements to tools, processes, and software.

-Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.


-A bachelor's degree in computer science or another highly technical, scientific discipline.

-Ability to program (structured and OO) with one or more 'scripting' languages (such as Python, Ruby, and/or JavaScript). Experience with Java, C, C++ is a definite advantage.

-In-depth knowledge and experience in at least one of: host based networking, Linux/UNIX administration, systems programming, distributed systems, databases, and a desire to learn more.

-The ability to quickly leverage off the shelf and open source systems and utilities to rapidly provision production systems in a variety of domains, especially for multi-tenant use.

-A proven track record of automation and an algorithmic approach to solving problems.

-A proactive approach to spotting problems, areas for improvement, performance bottlenecks, etc.

-An understanding of the operational concerns in a demanding environment; ideally, but not necessarily, finance.

Additional Skills Preferred:

-Familiar with relational database concepts and have the ability to construct at least moderately complex SQL queries.

-Experience with authentication and encryption technologies like SSL, Kerberos and GSSAPI.

-Networking experience, analyzing packet dumps, multicast routing on hosts, packet filtering.

-OS/kernel experience such as familiarity with OS tunables, log analysis.

-Experience with automated configuration management tools.