R
Rogo

Site Reliability Engineer (SRE)

Remote
Full-time
New York City
4 months ago

Job Overview

Actively Hiring
R

Rogo

View all remote opportunities

Job Type

100% Remote

Work from anywhere

Employment Type

Full-time

Flexible schedule

Location Preference

New York City

Preferred time zones

Job Categories

DevOps/SRE Software Engineering

Job Description

We're building Al thought partners to make people smarter and more creative, accelerating the creation and sharing of knowledge in financial services. We're unabashedly ambitious, and we're dead set on building the biggest Financial AI company in the world. Our team is lean, smart, and endlessly curious.

What You Will Own

  • Infrastructure Management: Design, deploy, and maintain cloud infrastructure on AWS and/or Azure, ensuring high availability and resilience.

  • Monitoring and Performance: Implement and manage monitoring solutions using Datadog to proactively identify and address system issues.

  • Container Orchestration: Manage Kubernetes clusters, utilizing Helm for package management and deployment automation.

  • Automation and Scripting: Develop and maintain Infrastructure as Code (IaC) using tools like Terraform, and create automation scripts in Bash or Python to streamline operations.

  • Collaboration: Work closely with development and operations teams to propagate DevOps culture, share best practices, and ensure seamless integration and deployment processes.

  • Incident Response: Troubleshoot and resolve complex cross-platform issues related to OS, networking, and databases in a cloud-based environment.

  • Documentation: Maintain comprehensive documentation of system configurations, procedures, and troubleshooting guides.

What You Will Need

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.

  • Experience

    • 3-5 years of hands-on experience with AWS and/or Azure cloud platforms, including services like EC2, S3, VPC, and Lambda.

    • 2-3 years of experience managing Kubernetes clusters in production environments.

    • 2-3 years of experience with Helm for Kubernetes package management.

    • 2-3 years of experience with Datadog or similar monitoring tools.

    • 3-5 years of experience with Linux system administration and shell scripting.

    • 2-3 years of experience with Infrastructure as Code (IaC) tools like Terraform.

  • Skills

    • Proficiency in scripting languages such as Bash and Python.

    • Strong understanding of networking fundamentals, including TCP/IP, DNS, and load balancing.

    • Experience with CI/CD pipelines and tools like Jenkins, GitLab CI, or GitHub Actions.

    • Experience with cloud-native security best practices and compliance frameworks.

    • Excellent problem-solving skills and the ability to navigate complex challenges effectively.

    • Strong communication and collaboration skills.

Bonus

  • Experience with MLOps monitoring and observability.

  • Experience with PostgreSQL, Elasticsearch, and vector databases such as Qdrant or similar technologies.

  • Experience with monitoring and security tools such as Datadog, AWS GuardDuty, CloudWatch, and CloudTrail.

  • Certifications in AWS, Azure, or Kubernetes.

  • Experience with other cloud platforms like Google Cloud Platform (GCP).

  • Experience with distributed tracing and observability tools.

Who You Are

  • You thrive in fast-paced environments. You are high-intensity and care a lot about what you do, and you're ecstatic to work at a start-up

  • You are ambitious. You have fun solving problems that others think are impossible.

  • You are curious. You find joy in learning about AI, technology, and finance

  • You are an owner. You are autonomous, self-directed, and comfortable working with ambiguity

  • You are collaborative, organized, and thoughtful.

Why Join Rogo?

  • Exceptional traction: strong PMF with the world's largest investment banks, hedge funds, and private equity firms.

  • World-class team: we take talent density seriously. We like working with incredibly smart, driven people.

  • Velocity: we work fast, which means you learn a lot and constantly take on new challenges.

  • Frontier technology: we're developing cutting-edge AI systems, pushing the boundaries of published research, redefining what's possible, and inventing the future.

  • Cutting Edge Product: Our platform is state-of-the-art and crazily powerful. We're creating tools that make people smarter, reinventing how you discover, create, and share knowledge.

Ready to Join Rogo?

Take the next step in your remote career. Click below to apply directly on Rogo's official careers page.

Apply on Rogo Website
Secure & Direct Application

More Jobs at Rogo

Explore other remote opportunities with this company