Prometheus Query Syntax Jobs in Usa

844 positions found — Page 4

Software Engineer–DevSecOps (Senior or Lead)
🏢 Boeing
Salary not disclosed
TUKWILA, WA 4 days ago

Job Description

At Boeing, we innovate and collaborate to make the world a better place. We’re committed to fostering an environment for every teammate that’s welcoming, respectful and inclusive, with great opportunity for professional growth. Find your future with us.

The Boeing Company is currently seeking a Software Engineer–DevSecOps (Senior or Lead) to support our E7A Software Development Environments team located in Tukwila, Washington.

This team is responsible for building and maintaining multiple secure development environments that enable continuous integration and secure software development for the US Air force E-7A Rapid Prototype Program. This position will focus on developing and maintaining RKE2 and EKS backed software development tools supporting Maven Gradle and Python development pipelines in both AWS and onsite disconnected environments.

Position Responsibilities:

  • Lead the design, development, testing and verification of E-7A software engineering tools

  • Develop and enhance our DevSecOps operations across multiple environments

  • Collaborate with cross-functional teams to integrate and implement solutions for software development

  • Provide end user support and troubleshooting to software development teams using program defined pipeline configurations and development environments

  • Monitor and maintain software development environments and tools

  • Participate in peer reviews and provide technical leadership to junior engineers

  • Utilize analytical problem-solving skills to resolve issues and improve processes and tools

Our team is currently hiring for a broad range of experience levels including: Senior and Lead Engineers.

This position is expected to be 100% onsite. The selected candidate will be required to work onsite at one of the listed location options.

Security Clearance and Export Control Requirements:

This position requires an active Secret U.S. Security Clearance. (A U.S. Security Clearance that has been active in the past 24 months is considered active.)

Basic Qualifications (Required Skills/ Experience):

  • 3+ years of experience administering Linux systems

  • 3+ years of experience administering Kubernetes EKS or RKE2 deployments

  • 3+ years of experience with AWS cloud development and containerization

  • Programming or scripting experience in Java or C++ and Python

Preferred Qualifications (Desired Skills/Experience):

  • Bachelor of Science degree from an accredited course of study in engineering, engineering technology (includes manufacturing engineering technology), chemistry, physics, mathematics, data science, or computer science

  • Ability to set up, and manage a toolchain that uses GitLab, GitLab-CI, Gradle, Maven, Nexus/Artifactory, SonarQube, Prometheus, FluentBit, CloudWatch, Helm, Terraform, Ansible, OpenShift, Docker and more

  • AWS Certification is preferred

  • Containerization knowledge including Docker, Kubernetes, Helm, Rancher, Istio, Big Bang

  • Understanding of designing and implementing full stack/Microservice infrastructure

  • Understanding of secure software development methodologies and Security First mindset

  • Understanding of cloud architecture and design methodologies

  • Strong Working knowledge of the CI/CD process including debugging, test, and integration of software tools

  • Knowledge of general software development and testing tools, including compilers, linkers, debuggers, and requirements management tools

  • Experience with Confluence, Jira

Drug Free Workplace:

Boeing is a Drug Free Workplace (DFW) where post offer applicants and employees are subject to testing for marijuana, cocaine, opioids, amphetamines, PCP, and alcohol when criteria is met as outlined in our policies.

Union:

This is a union-represented position.

CodeVue Coding Challenge: 

To be considered for this position you will be required to complete a technical assessment as part of the selection process. Failure to complete the assessment will remove you from consideration.

Pay & Benefits:

At Boeing, we strive to deliver a Total Rewards package that will attract, engage and retain the top talent.  Elements of the Total Rewards package include competitive base pay and variable compensation opportunities. 

The Boeing Company also provides eligible employees with an opportunity to enroll in a variety of benefit programs, generally including health insurance, flexible spending accounts, health savings accounts, retirement savings plans, life and disability insurance programs, and a number of programs that provide for both paid and unpaid time away from work. 

The specific programs and options available to any given employee may vary depending on eligibility factors such as geographic location, date of hire, and the applicability of collective bargaining agreements.

Pay is based upon candidate experience and qualifications, as well as market and business considerations. 

Summary pay range for Senior Level: $130,900 - $177,100.

Summary pay range for Lead Level: $161,500 - $218,500.


Applications for this position will be accepted until Mar. 30, 2026


Export Control Requirements:

This position must meet U.S. export control compliance requirements. To meet U.S. export control compliance requirements, a “U.S. Person” as defined by 22 C.F.R. §120.62 is required. “U.S. Person” includes U.S. Citizen, U.S. National, lawful permanent resident, refugee, or asylee.

Export Control Details:

US based job, US Person required

Relocation

This position offers relocation based on candidate eligibility.

Security Clearance

This position requires an active U.S. Secret Security Clearance (U.S. Citizenship Required). (A U.S. Security Clearance that has been active in the past 24 months is considered active)

Visa Sponsorship

Employer will not sponsor applicants for employment visa status.

Shift

This position is for 1st shift


Equal Opportunity Employer:

Boeing is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, age, physical or mental disability, genetic factors, military/veteran status or other characteristics protected by law.

permanent
Cloud Infrastructure Automation Engineer (AUSTIN)
Salary not disclosed
AUSTIN, Texas 4 days ago
**This position supports hybrid work schedule depending on organization needs.**

RESPONSIBILITIES:

- Architect, design, and maintain scalable CI/CD pipelines using Azure/AWS DevSecOps.

- Build and optimize Docker-based microservices, images, and deployment pipelines.

- Lead deployments across Docker Swarm, Kubernetes/EKS, and multi-location environments.

- Develop infrastructure automation using Ansible, bash scripting, Terraform and Git-based workflow.

- Manage release pipelines using container registries, artifact feeds, template pipelines, and multi-stage workflows.

- Design multi-environment strategies for dev, QA, staging, and production deployment.

- Implement cloud-native services with AWS & Azure cloud platforms.

- Implement basic security practices, including IAM roles, secrets management, and access controls.

- Develop secure, modular, reusable build and release systems.

- Work closely with full-stack engineering teams (Angular, Java, Python , backend APIs, database engineers).

- Mentor junior DevOps engineers and lead DevOps roadmap decisions.

KNOWLEDGE REQUIREMENTS:

DevOps Expertise:

Azure DevOps pipelines, YAML templating, CI/CD strategy, Git branching models.

Containerization & Orchestration:

Docker images, Docker Compose, Docker Swarm, multi-node/multi-location deployments.

Cloud Technologies:

Azure deployments & infrastructure, AWS (IAM, Lambda, S3, CloudWatch).

Programming / Scripting Languages:

Python, Bash, Linux/Unix administration, awk, shell automation, groovy.

Infrastructure Automation:

Ansible playbooks, tasks/roles, inventory design, configuration management.

Distributed Deployment Architecture:

Multi-site replication, node selection by IP, dynamic service routing.

Database Stack Experience:

PostgreSQL, MySQL, MariaDB operations & migrations.

Observability & Logging:

CloudWatch monitoring, log collection, Prometheus, Grafana, reporting & metrics.

Version Control & Build Systems:

Azure Devops, Git, Git submodules, artifact storage, registry solutions, Secrets Management.

Nice to have AI knowledge/experience and willingness to learn.

EDUCATION & EXPERIENCE REQUIREMENTS

- BS degree in Electrical/Computer Engineering, Computer Science or related field. MS preferred.
- 7+ years experience in a software devops/development/test capacity with enterprise server, storage or networking products.
temporary
CI/CD Pipeline Architect (AUSTIN)
🏢 JABIL CIRCUIT, INC
Salary not disclosed
AUSTIN, Texas 4 days ago
**This position supports hybrid work schedule depending on organization needs.**

RESPONSIBILITIES:

- Architect, design, and maintain scalable CI/CD pipelines using Azure/AWS DevSecOps.

- Build and optimize Docker-based microservices, images, and deployment pipelines.

- Lead deployments across Docker Swarm, Kubernetes/EKS, and multi-location environments.

- Develop infrastructure automation using Ansible, bash scripting, Terraform and Git-based workflow.

- Manage release pipelines using container registries, artifact feeds, template pipelines, and multi-stage workflows.

- Design multi-environment strategies for dev, QA, staging, and production deployment.

- Implement cloud-native services with AWS & Azure cloud platforms.

- Implement basic security practices, including IAM roles, secrets management, and access controls.

- Develop secure, modular, reusable build and release systems.

- Work closely with full-stack engineering teams (Angular, Java, Python , backend APIs, database engineers).

- Mentor junior DevOps engineers and lead DevOps roadmap decisions.

KNOWLEDGE REQUIREMENTS:

DevOps Expertise:

Azure DevOps pipelines, YAML templating, CI/CD strategy, Git branching models.

Containerization & Orchestration:

Docker images, Docker Compose, Docker Swarm, multi-node/multi-location deployments.

Cloud Technologies:

Azure deployments & infrastructure, AWS (IAM, Lambda, S3, CloudWatch).

Programming / Scripting Languages:

Python, Bash, Linux/Unix administration, awk, shell automation, groovy.

Infrastructure Automation:

Ansible playbooks, tasks/roles, inventory design, configuration management.

Distributed Deployment Architecture:

Multi-site replication, node selection by IP, dynamic service routing.

Database Stack Experience:

PostgreSQL, MySQL, MariaDB operations & migrations.

Observability & Logging:

CloudWatch monitoring, log collection, Prometheus, Grafana, reporting & metrics.

Version Control & Build Systems:

Azure Devops, Git, Git submodules, artifact storage, registry solutions, Secrets Management.

Nice to have AI knowledge/experience and willingness to learn.

EDUCATION & EXPERIENCE REQUIREMENTS

- BS degree in Electrical/Computer Engineering, Computer Science or related field. MS preferred.
- 7+ years experience in a software devops/development/test capacity with enterprise server, storage or networking products.
temporary
Devops CI/CD Engineer (AUSTIN)
🏢 JABIL CIRCUIT, INC
Salary not disclosed
AUSTIN, Texas 4 days ago
**This position supports hybrid work schedule depending on organization needs.**

RESPONSIBILITIES:

- Architect, design, and maintain scalable CI/CD pipelines using Azure/AWS DevSecOps.

- Build and optimize Docker-based microservices, images, and deployment pipelines.

- Lead deployments across Docker Swarm, Kubernetes/EKS, and multi-location environments.

- Develop infrastructure automation using Ansible, bash scripting, Terraform and Git-based workflow.

- Manage release pipelines using container registries, artifact feeds, template pipelines, and multi-stage workflows.

- Design multi-environment strategies for dev, QA, staging, and production deployment.

- Implement cloud-native services with AWS & Azure cloud platforms.

- Implement basic security practices, including IAM roles, secrets management, and access controls.

- Develop secure, modular, reusable build and release systems.

- Work closely with full-stack engineering teams (Angular, Java, Python , backend APIs, database engineers).

- Mentor junior DevOps engineers and lead DevOps roadmap decisions.

KNOWLEDGE REQUIREMENTS:

DevOps Expertise:

Azure DevOps pipelines, YAML templating, CI/CD strategy, Git branching models.

Containerization & Orchestration:

Docker images, Docker Compose, Docker Swarm, multi-node/multi-location deployments.

Cloud Technologies:

Azure deployments & infrastructure, AWS (IAM, Lambda, S3, CloudWatch).

Programming / Scripting Languages:

Python, Bash, Linux/Unix administration, awk, shell automation, groovy.

Infrastructure Automation:

Ansible playbooks, tasks/roles, inventory design, configuration management.

Distributed Deployment Architecture:

Multi-site replication, node selection by IP, dynamic service routing.

Database Stack Experience:

PostgreSQL, MySQL, MariaDB operations & migrations.

Observability & Logging:

CloudWatch monitoring, log collection, Prometheus, Grafana, reporting & metrics.

Version Control & Build Systems:

Azure Devops, Git, Git submodules, artifact storage, registry solutions, Secrets Management.

Nice to have AI knowledge/experience and willingness to learn.

EDUCATION & EXPERIENCE REQUIREMENTS

- BS degree in Electrical/Computer Engineering, Computer Science or related field. MS preferred.
- 7+ years experience in a software devops/development/test capacity with enterprise server, storage or networking products.
temporary
Software Engineer-DevSecOps (Senior or Lead)
🏢 Boeing
$130,900
Tukwila, Washington 3 days ago
Job Description
At Boeing, we innovate and collaborate to make the world a better place. We're committed to fostering an environment for every teammate that's welcoming, respectful and inclusive, with great opportunity for professional growth. Find your future with us.
The Boeing Company is currently seeking a Software Engineer-DevSecOps (Senior or Lead) to support our E7A Software Development Environments team located in Tukwila, Washington .

This team is responsible for building and maintaining multiple secure development environments that enable continuous integration and secure software development for the US Air force E-7A Rapid Prototype Program. This position will focus on developing and maintaining RKE2 and EKS backed software development tools supporting Maven Gradle and Python development pipelines in both AWS and onsite disconnected environments.

Position Responsibilities:

* Lead the design, development, testing and verification of E-7A software engineering tools
* Develop and enhance our DevSecOps operations across multiple environments
* Collaborate with cross-functional teams to integrate and implement solutions for software development
* Provide end user support and troubleshooting to software development teams using program defined pipeline configurations and development environments
* Monitor and maintain software development environments and tools
* Participate in peer reviews and provide technical leadership to junior engineers
* Utilize analytical problem-solving skills to resolve issues and improve processes and tools

Our team is currently hiring for a broad range of experience levels including: Senior and Lead Engineers.

This position is expected to be 100% onsite. The selected candidate will be required to work onsite at one of the listed location options.

Security Clearance and Export Control Requirements:
This position requires an active Secret U.S. Security Clearance. (A U.S. Security Clearance that has been active in the past 24 months is considered active.)

Basic Qualifications (Required Skills/ Experience):

* 3+ years of experience administering Linux systems
* 3+ years of experience administering Kubernetes EKS or RKE2 deployments
* 3+ years of experience with AWS cloud development and containerization
* Programming or scripting experience in Java or C++ and Python

Preferred Qualifications (Desired Skills/Experience):

* Bachelor of Science degree from an accredited course of study in engineering, engineering technology (includes manufacturing engineering technology), chemistry, physics, mathematics, data science, or computer science
* Ability to set up, and manage a toolchain that uses GitLab, GitLab-CI, Gradle, Maven, Nexus/Artifactory, SonarQube, Prometheus, FluentBit, CloudWatch, Helm, Terraform, Ansible, OpenShift, Docker and more
* AWS Certification is preferred
* Containerization knowledge including Docker, Kubernetes, Helm, Rancher, Istio, Big Bang
* Understanding of designing and implementing full stack/Microservice infrastructure
* Understanding of secure software development methodologies and Security First mindset
* Understanding of cloud architecture and design methodologies
* Strong Working knowledge of the CI/CD process including debugging, test, and integration of software tools
* Knowledge of general software development and testing tools, including compilers, linkers, debuggers, and requirements management tools
* Experience with Confluence, Jira

Drug Free Workplace:
Boeing is a Drug Free Workplace (DFW) where post offer applicants and employees are subject to testing for marijuana, cocaine, opioids, amphetamines, PCP, and alcohol when criteria is met as outlined in our policies.

Union:
This is a union-represented position.

CodeVue Coding Challenge:
To be considered for this position you will be required to complete a technical assessment as part of the selection process. Failure to complete the assessment will remove you from consideration.

Pay & Benefits:
At Boeing, we strive to deliver a Total Rewards package that will attract, engage and retain the top talent. Elements of the Total Rewards package include competitive base pay and variable compensation opportunities.

The Boeing Company also provides eligible employees with an opportunity to enroll in a variety of benefit programs, generally including health insurance, flexible spending accounts, health savings accounts, retirement savings plans, life and disability insurance programs, and a number of programs that provide for both paid and unpaid time away from work.

The specific programs and options available to any given employee may vary depending on eligibility factors such as geographic location, date of hire, and the applicability of collective bargaining agreements.

Pay is based upon candidate experience and qualifications, as well as market and business considerations.

Summary pay range for Senior Level: $130,900 - $177,100.
Summary pay range for Lead Level: $161,500 - $218,500.

Applications for this position will be accepted until Mar. 30, 2026

Export Control Requirements:
This position must meet U.S. export control compliance requirements. To meet U.S. export control compliance requirements, a "U.S. Person" as defined by 22 C.F.R. §120.62 is required. "U.S. Person" includes U.S. Citizen, U.S. National, lawful permanent resident, refugee, or asylee.
Export Control Details:
US based job, US Person required
Relocation
This position offers relocation based on candidate eligibility.
Security Clearance
This position requires an active U.S. Secret Security Clearance (U.S. Citizenship Required). (A U.S. Security Clearance that has been active in the past 24 months is considered active)
Visa Sponsorship
Employer will not sponsor applicants for employment visa status.
Shift
This position is for 1st shift

Equal Opportunity Employer:
Boeing is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, age, physical or mental disability, genetic factors, military/veteran status or other characteristics protected by law.
Not Specified
Devops CI/CD Engineer - Flexible Hybrid Work Schedule (AUSTIN)
🏢 JABIL CIRCUIT, INC
Salary not disclosed
**This position supports hybrid work schedule depending on organization needs.**

RESPONSIBILITIES:

- Architect, design, and maintain scalable CI/CD pipelines using Azure/AWS DevSecOps.

- Build and optimize Docker-based microservices, images, and deployment pipelines.

- Lead deployments across Docker Swarm, Kubernetes/EKS, and multi-location environments.

- Develop infrastructure automation using Ansible, bash scripting, Terraform and Git-based workflow.

- Manage release pipelines using container registries, artifact feeds, template pipelines, and multi-stage workflows.

- Design multi-environment strategies for dev, QA, staging, and production deployment.

- Implement cloud-native services with AWS & Azure cloud platforms.

- Implement basic security practices, including IAM roles, secrets management, and access controls.

- Develop secure, modular, reusable build and release systems.

- Work closely with full-stack engineering teams (Angular, Java, Python , backend APIs, database engineers).

- Mentor junior DevOps engineers and lead DevOps roadmap decisions.

KNOWLEDGE REQUIREMENTS:

DevOps Expertise:

Azure DevOps pipelines, YAML templating, CI/CD strategy, Git branching models.

Containerization & Orchestration:

Docker images, Docker Compose, Docker Swarm, multi-node/multi-location deployments.

Cloud Technologies:

Azure deployments & infrastructure, AWS (IAM, Lambda, S3, CloudWatch).

Programming / Scripting Languages:

Python, Bash, Linux/Unix administration, awk, shell automation, groovy.

Infrastructure Automation:

Ansible playbooks, tasks/roles, inventory design, configuration management.

Distributed Deployment Architecture:

Multi-site replication, node selection by IP, dynamic service routing.

Database Stack Experience:

PostgreSQL, MySQL, MariaDB operations & migrations.

Observability & Logging:

CloudWatch monitoring, log collection, Prometheus, Grafana, reporting & metrics.

Version Control & Build Systems:

Azure Devops, Git, Git submodules, artifact storage, registry solutions, Secrets Management.

Nice to have AI knowledge/experience and willingness to learn.

EDUCATION & EXPERIENCE REQUIREMENTS

- BS degree in Electrical/Computer Engineering, Computer Science or related field. MS preferred.
- 7+ years experience in a software devops/development/test capacity with enterprise server, storage or networking products.
Remote working/work at home options are available for this role.
temporary
DevOps Engineering,
🏢 JABIL CIRCUIT, INC
Salary not disclosed
Kyle, TX 2 days ago
**This position supports hybrid work schedule depending on organization needs.**

RESPONSIBILITIES:
~ Architect, design, and maintain scalable CI/CD pipelines using Azure/AWS DevSecOps.

~ Build and optimize Docker-based microservices, images, and deployment pipelines.

~ Lead deployments across Docker Swarm, Kubernetes/EKS, and multi-location environments.

~ Develop infrastructure automation using Ansible, bash scripting, Terraform and Git-based workflow.

~ Manage release pipelines using container registries, artifact feeds, template pipelines, and multi-stage workflows.

~ Design multi-environment strategies for dev, QA, staging, and production deployment.

~ Implement cloud-native services with AWS & Azure cloud platforms.

~ Implement basic security practices, including IAM roles, secrets management, and access controls.

~ Develop secure, modular, reusable build and release systems.

~ Work closely with full-stack engineering teams (Angular, Java, Python , backend APIs, database engineers).

~ Mentor junior DevOps engineers and lead DevOps roadmap decisions.

KNOWLEDGE REQUIREMENTS:

DevOps Expertise :
Azure DevOps pipelines, YAML templating, CI/CD strategy, Git branching models.

Containerization & Orchestration :
Docker images, Docker Compose, Docker Swarm, multi-node/multi-location deployments.

Cloud Technologies :
Azure deployments & infrastructure, AWS (IAM, Lambda, S3, CloudWatch).

Programming / Scripting Languages :
Python, Bash, Linux/Unix administration, awk, shell automation, groovy.

Infrastructure Automation :
Ansible playbooks, tasks/roles, inventory design, configuration management.

Distributed Deployment Architecture :
Multi-site replication, node selection by IP, dynamic service routing.

Database Stack Experience :
PostgreSQL, MySQL, MariaDB operations & migrations.

Observability & Logging :
CloudWatch monitoring, log collection, Prometheus, Grafana, reporting & metrics.

Version Control & Build Systems :
Azure Devops, Git, Git submodules, artifact storage, registry solutions, Secrets Management.

Nice to have AI knowledge/experience and willingness to learn.

EDUCATION & EXPERIENCE REQUIREMENTS
~ BS degree in Electrical/Computer Engineering, Computer Science or related field. MS preferred.
~7+ years experience in a software devops/development/test capacity with enterprise server, storage or networking products.
permanent
Physician / New York / Permanent / DevOps / SRE Job
✦ New
Salary not disclosed
New York 1 day ago
An asset manager in New York City is actively seeking a self-motivated and hardworking professional to join their staff as their newDevOps / SRE.

Responsibilities The DevOps / SRE will: Build and maintain the infrastructure that supports the firm's trading systems Collaborate with development teams to design and implement automated build and deployment pipelines Drive the rapid adoption of new processes/systems Provide hands-on support to the trading team Qualifications BS/MS in Computer Science, Engineering, or related discipline 5+ years experience in the Platform, SRE, Production, or Systems Engineering fields Excellent knowledge of all aspects of the software engineering process, including Coding, Testing, Deployment, Scalability, Security, and Maintainability Ability to set-up andmanage CI/CD activities and tools (e.g.

Gitlab, Bitbucket), as well as build you own solutions (e.g.

Java/Gradle) Track record of working with distributed systems in a trading environment e.g.

Aeron, Kafka, and RabbitMQ Deep understanding of best practices, design patterns, and principles for highly decoupled and scalable systems Good knowledge of Unix systems / Bash / networks Experience with infrastructure and application observability tooling e.g.

Datadog, Prometheus, and Grafana Strong knowledge in coding/scripting (Java, Python, Go, or Bash) Experience with automation/configuration frameworks using Terraform, Kustomize, Ansible, Helm, or an equivalent Desired skills Experience with cloud platforms (ideally AWS) Experience in API Management (routing, gateways, versioning) with profound understanding of API Development aspects Ability to apply strategies for efficient communication, data consistency, and resilience across micro services, including experience with API design, message-based communication, and event-driven architectures Experience in defining and enforcing architectural patterns (SOA, CQRS, Event Sourcing etc.) Experience in performance/stress test and system tuning
permanent
Observability and AI Enterprise Architect
✦ New
🏢 ClifyX
Salary not disclosed
Edison, NJ 1 day ago

Key Responsibilities:

  • Design and deploy observability frameworks leveraging tools such as Grafana, Dynatrace, Prometheus, ELK, Splunk, etc. Define best practices for monitoring, alerting, and visualization across hybrid and multi-cloud environments.
  • Develop strategies for monitoring KPIs tied to business outcomes (e.g., sales performance, supply chain efficiency, customer experience).
  • Collaborate with business and IT teams to identify key metrics and integrate them into dashboards and alerting systems.
  • Implement AIOps solutions using industry-leading platforms like OpenAI, AWS Bedrock, Google Gemini, Anthropic, and similar technologies.
  • Develop predictive analytics and anomaly detection models to proactively identify and resolve operational issues.
  • Integrate observability tools with ITSM platforms and automation workflows. Enable automated root cause analysis and remediation using AI/ML models.
  • Provide observability strategies for infrastructure (servers, storage, cloud), applications (microservices, APIs), and networks (LAN/WAN, SD-WAN). Collaborate with DevOps, SRE, and IT operations teams to ensure end-to-end visibility and reliability.
  • Establish observability standards, KPIs, and SLAs for performance and availability. Ensure compliance with security and regulatory requirements in monitoring solutions.
  • Develop scalable architecture using LLMs, agentic frameworks, and multi-modal AI technologies.
  • Build AI-powered analytics platforms for IT operations analysis, anomaly detection, and predictive insights.
  • Architect and deploy intelligent chatbots for IT support and self-service capabilities.
  • Integrate AI solutions with existing IT operations tools and workflows.
  • Implement automated remediation and root cause analysis using AI/ML models.


Qualifications:

  • 10-13 years of relevant experience
  • Hands-on experience with Grafana, Dynatrace, and other monitoring platforms.
  • Practical experience implementing AI-based solutions for anomaly detection, predictive maintenance, and automated remediation. Familiarity with OpenAI, Bedrock, Gemini, Anthropic, or similar AI platforms.
  • Strong understanding of infrastructure, application architectures, and networking. Experience with cloud platforms (AWS, Azure, GCP) and container orchestration (Kubernetes).
  • Proficiency in Python, Bash, or similar scripting languages for automation and integration.
  • Strong experience with LLMs (OpenAI, Anthropic, Gemini, Bedrock) and agentic AI solutions.
  • Hands-on experience in designing AI architectures for enterprise IT environments.
  • Proficiency in Python or similar languages for AI model integration and automation.
Not Specified
Site Reliability Engineer II
Salary not disclosed
Alpharetta, GA 3 days ago
Title: Site Reliability Engineer II

Location: Alpharetta, GA (3 days a week onsite)

Duration: 6 months


Job Description:

We are seeking a skilled Site Reliability Engineer to join our team and help build, maintain, and scale our cloud-native infrastructure. You will work closely with development and operations teams to ensure our systems are reliable, scalable, and efficient. The ideal candidate is passionate about automation, observability, and infrastructure-as-code, and thrives in a collaborative, fast-paced environment.

Key Responsibilities



  • Design, implement, and manage cloud infrastructure on Azure using Terraform and Terragrunt.


  • Maintain and optimize Kubernetes clusters on Azure Kubernetes Service (AKS).


  • Build and manage CI/CD pipelines using GitHub Actions/Workflows and ArgoCD for GitOps deployments.


  • Enhance system reliability by implementing monitoring, alerting, and observability solutions with Grafana.


  • Automate operational tasks to reduce toil and improve team efficiency.


  • Participate in on-call rotations, incident response, and post-mortem analysis.


  • Collaborate with development teams to improve application performance, scalability, and resilience.


  • Implement and advocate for SRE best practices, including SLIs, SLOs, and error budgets.


  • Continuously improve system performance, cost efficiency, and security.



Required Skills & Qualifications



  • 3+ years of experience in an SRE, DevOps, or cloud infrastructure role.


  • Strong experience with Azure cloud services and infrastructure.


  • Hands-on experience with java and Terraform and Terragrunt for infrastructure-as-code.


  • Proficiency with Kubernetes (preferably AKS and container orchestration.


  • Experience with CI/CD tools, especially GitHub Workflows/Actions and ArgoCD.


  • Solid understanding of observability tools like Grafana (Prometheus, Loki, Tempo experience is a plus).

    Education Requirements Bachelor's degree required, (Masters preferred)

Not Specified
jobs by JobLookup
✓ All jobs loaded