Unlimited Job Postings Subscription - $99/yr!

Job Details

Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)

  2025-06-18     Splunk     all cities,AK  
Description:

Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)

Join to apply for the Observability Capacity SRE Engineer (East Coast, FULLY REMOTE) role at Splunk

Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)

Join to apply for the Observability Capacity SRE Engineer (East Coast, FULLY REMOTE) role at Splunk

Get AI-powered advice on this job and more exclusive features.

Direct message the job poster from Splunk

Lead Technical AI Recruiter, Artificial Intelligence/Machine Learning at Splunk (30K+ Connections); Actively recruiting AI/ML talent mid career to

About Us

Splunk Cloud Operations is where critical thinking meets real-world impact. Our teams operate at the heart of the Splunk Platform, ensuring stability, performance, and continuous improvement across a massive cloud ecosystem. We work at speed, across boundaries, and with purpose solving sophisticated, large-scale problems that directly affect customers worldwide.

The Opportunity

This isnt your average support or SRE role. As an Observability Capacity Engineer, youll play a strategic role in ensuring Splunks Observability products scale effectively and serve customers reliably. Youll operate at the intersection of systems engineering, platform operations, and tooling making data-driven decisions to optimize how capacity is provisioned and how services run across a distributed architecture.

This is a high-impact role for an engineer who enjoys digging into infrastructure puzzles, building smarter systems, and acting as a connective force between Engineering, Support, and Product.

What Youll Do

  • Triage and resolve inbound quota and capacity requests for Observability customers.
  • Fine-tune backend configurations to match customer traffic patterns and platform load.
  • Maintain stability and scalability of a shared, distributed infrastructure supporting hundreds of tenants.
  • Monitor platform usage and proactively ensure capacity is available for customer growth.
  • Collaborate with Engineering teams to identify and resolve critical performance or availability issues.
  • Define requirements and advocate for tooling improvements that reduce manual effort and speed up delivery.
  • Use your engineering mindset to drive continuous process and system optimization.
  • Support the broader Fulfillment Operations team through the construction and upkeep of business critical Splunk dashboards.

What You Bring

  • 35 years of experience in software engineering, DevOps, SRE, or platform operations roles.
  • Working knowledge of cloud-native infrastructure, distributed microservice architectures, and CI/CD pipelines.
  • Strong debugging and systems thinking skills you can connect symptoms to root causes across layers.
  • Proficiency with the command line; hands-on experience with Jira or similar systems.
  • Familiarity with observability tools (e.g., metrics, logging, tracing platforms).
  • Comfortable balancing tactical execution with strategic thinking you enjoy both shipping and shaping.
  • Strong collaboration skills and the ability to partner effectively with engineering, support, and product teams.
  • Hands-on experience building and optimizing Splunk dashboards.
  • Bonus Experience with entitlement systems or Salesforce is a plus.

Why Join Us

  • Youll work on systems at scale with tangible customer impact.
  • Youll gain deep exposure to observability tooling, platform architecture, and operational strategy.
  • Youll influence how we build, automate, and evolve capacity workflows across the company.
  • Youll be part of a team that values autonomy, critical thinking, and cross-functional collaboration.

Splunk, a Cisco company, is an Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis.

Seniority level
  • Seniority level
    Not Applicable
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Engineering and Information Technology
  • Industries
    Software Development, IT Services and IT Consulting, and Technology, Information and Internet

Referrals increase your chances of interviewing at Splunk by 2x

Sign in to set job alerts for Site Reliability Engineer roles.
Site Reliability Engineer (SRE) - Platform Infrastructure team (100% Remote - USA)
Full Stack Software Engineer (L5), Content Middleware Infrastructure
Senior DevOps and Site Reliability Engineer, remote

New York, NY $165,000.00-$200,000.00 2 months ago

New York, NY $180,000.00-$220,000.00 1 month ago

New York, NY $145,000.00-$260,000.00 6 months ago

New York, NY $140,000.00-$185,000.00 11 hours ago

New York, NY $180,000.00-$200,000.00 4 months ago

United States $85.10-$251,000.00 1 week ago

Were unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr


Apply for this Job

Please use the APPLY HERE link below to view additional details and application instructions.

Apply Here

Back to Search