All jobs
MEMXDevOps
Member of Staff, Systems Reliability Engineering (International Remote Hire)
Remote (US)Posted today
MEMX is seeking a Systems Reliability Operator to support exchange platforms during US overnight shifts, focusing on incident response, system support, and operational improvements.
Location: Remote (US)
Responsibilities
- Provide support for MEMX exchange platforms including on-call, respond to incidents, and support triaging issues.
- Help isolate and resolve unplanned system outages.
- Work with cross-functional teams to support platform availability, including market operations, systems, networking, and development teams.
- Help improve operational processes such as deployments and upgrades.
- Document actions to facilitate automation.
- Debug issues across different services and interaction points.
- Enhance monitoring and alerting based on symptoms.
- Run nightly processes essential to exchange operations, automating where possible.
Requirements
- Good understanding of Linux and Linux Shell.
- Mid to advanced Linux administration and scripting skills.
- Proficiency in Bash scripting.
- Proficiency in a configuration management tool (Ansible, Chef, Puppet).
- Experience with monitoring tools.
- Familiarity with incident tracking/ticketing systems and escalation procedures.
- 2+ years of experience in an operation support role with incident response.
- Highly curious, detail-oriented, and problem-solving mindset.
- Strong collaboration skills and process improvement mindset.
- Ability to deliver quickly and iterate fast.
- Trading and/or exchange experience is a plus but not required.
Benefits
- Work From Home
- Training & Development
- Wellness Resources
- Additional benefits based on region
Similar remote jobs
yesterday