Prometheus Ql Examples Jobs in Usa

661 positions found — Page 2

Devops CI/CD Engineer - Flexible Hybrid Work Schedule (AUSTIN)
Salary not disclosed
**This position supports hybrid work schedule depending on organization needs.**

RESPONSIBILITIES:

- Architect, design, and maintain scalable CI/CD pipelines using Azure/AWS DevSecOps.

- Build and optimize Docker-based microservices, images, and deployment pipelines.

- Lead deployments across Docker Swarm, Kubernetes/EKS, and multi-location environments.

- Develop infrastructure automation using Ansible, bash scripting, Terraform and Git-based workflow.

- Manage release pipelines using container registries, artifact feeds, template pipelines, and multi-stage workflows.

- Design multi-environment strategies for dev, QA, staging, and production deployment.

- Implement cloud-native services with AWS & Azure cloud platforms.

- Implement basic security practices, including IAM roles, secrets management, and access controls.

- Develop secure, modular, reusable build and release systems.

- Work closely with full-stack engineering teams (Angular, Java, Python , backend APIs, database engineers).

- Mentor junior DevOps engineers and lead DevOps roadmap decisions.

KNOWLEDGE REQUIREMENTS:

DevOps Expertise:

Azure DevOps pipelines, YAML templating, CI/CD strategy, Git branching models.

Containerization & Orchestration:

Docker images, Docker Compose, Docker Swarm, multi-node/multi-location deployments.

Cloud Technologies:

Azure deployments & infrastructure, AWS (IAM, Lambda, S3, CloudWatch).

Programming / Scripting Languages:

Python, Bash, Linux/Unix administration, awk, shell automation, groovy.

Infrastructure Automation:

Ansible playbooks, tasks/roles, inventory design, configuration management.

Distributed Deployment Architecture:

Multi-site replication, node selection by IP, dynamic service routing.

Database Stack Experience:

PostgreSQL, MySQL, MariaDB operations & migrations.

Observability & Logging:

CloudWatch monitoring, log collection, Prometheus, Grafana, reporting & metrics.

Version Control & Build Systems:

Azure Devops, Git, Git submodules, artifact storage, registry solutions, Secrets Management.

Nice to have AI knowledge/experience and willingness to learn.

EDUCATION & EXPERIENCE REQUIREMENTS

- BS degree in Electrical/Computer Engineering, Computer Science or related field. MS preferred.
- 7+ years experience in a software devops/development/test capacity with enterprise server, storage or networking products.
Remote working/work at home options are available for this role.
temporary
DevOps Engineering,
🏢 JABIL CIRCUIT, INC
Salary not disclosed
Kyle, TX 2 days ago
**This position supports hybrid work schedule depending on organization needs.**

RESPONSIBILITIES:
~ Architect, design, and maintain scalable CI/CD pipelines using Azure/AWS DevSecOps.

~ Build and optimize Docker-based microservices, images, and deployment pipelines.

~ Lead deployments across Docker Swarm, Kubernetes/EKS, and multi-location environments.

~ Develop infrastructure automation using Ansible, bash scripting, Terraform and Git-based workflow.

~ Manage release pipelines using container registries, artifact feeds, template pipelines, and multi-stage workflows.

~ Design multi-environment strategies for dev, QA, staging, and production deployment.

~ Implement cloud-native services with AWS & Azure cloud platforms.

~ Implement basic security practices, including IAM roles, secrets management, and access controls.

~ Develop secure, modular, reusable build and release systems.

~ Work closely with full-stack engineering teams (Angular, Java, Python , backend APIs, database engineers).

~ Mentor junior DevOps engineers and lead DevOps roadmap decisions.

KNOWLEDGE REQUIREMENTS:

DevOps Expertise :
Azure DevOps pipelines, YAML templating, CI/CD strategy, Git branching models.

Containerization & Orchestration :
Docker images, Docker Compose, Docker Swarm, multi-node/multi-location deployments.

Cloud Technologies :
Azure deployments & infrastructure, AWS (IAM, Lambda, S3, CloudWatch).

Programming / Scripting Languages :
Python, Bash, Linux/Unix administration, awk, shell automation, groovy.

Infrastructure Automation :
Ansible playbooks, tasks/roles, inventory design, configuration management.

Distributed Deployment Architecture :
Multi-site replication, node selection by IP, dynamic service routing.

Database Stack Experience :
PostgreSQL, MySQL, MariaDB operations & migrations.

Observability & Logging :
CloudWatch monitoring, log collection, Prometheus, Grafana, reporting & metrics.

Version Control & Build Systems :
Azure Devops, Git, Git submodules, artifact storage, registry solutions, Secrets Management.

Nice to have AI knowledge/experience and willingness to learn.

EDUCATION & EXPERIENCE REQUIREMENTS
~ BS degree in Electrical/Computer Engineering, Computer Science or related field. MS preferred.
~7+ years experience in a software devops/development/test capacity with enterprise server, storage or networking products.
permanent
Site Reliability Engineer II
Salary not disclosed
Alpharetta, GA 3 days ago
Title: Site Reliability Engineer II

Location: Alpharetta, GA (3 days a week onsite)

Duration: 6 months


Job Description:

We are seeking a skilled Site Reliability Engineer to join our team and help build, maintain, and scale our cloud-native infrastructure. You will work closely with development and operations teams to ensure our systems are reliable, scalable, and efficient. The ideal candidate is passionate about automation, observability, and infrastructure-as-code, and thrives in a collaborative, fast-paced environment.

Key Responsibilities



  • Design, implement, and manage cloud infrastructure on Azure using Terraform and Terragrunt.


  • Maintain and optimize Kubernetes clusters on Azure Kubernetes Service (AKS).


  • Build and manage CI/CD pipelines using GitHub Actions/Workflows and ArgoCD for GitOps deployments.


  • Enhance system reliability by implementing monitoring, alerting, and observability solutions with Grafana.


  • Automate operational tasks to reduce toil and improve team efficiency.


  • Participate in on-call rotations, incident response, and post-mortem analysis.


  • Collaborate with development teams to improve application performance, scalability, and resilience.


  • Implement and advocate for SRE best practices, including SLIs, SLOs, and error budgets.


  • Continuously improve system performance, cost efficiency, and security.



Required Skills & Qualifications



  • 3+ years of experience in an SRE, DevOps, or cloud infrastructure role.


  • Strong experience with Azure cloud services and infrastructure.


  • Hands-on experience with java and Terraform and Terragrunt for infrastructure-as-code.


  • Proficiency with Kubernetes (preferably AKS and container orchestration.


  • Experience with CI/CD tools, especially GitHub Workflows/Actions and ArgoCD.


  • Solid understanding of observability tools like Grafana (Prometheus, Loki, Tempo experience is a plus).

    Education Requirements Bachelor's degree required, (Masters preferred)

Not Specified
Staff Software Engineer, Observability
Salary not disclosed
San Francisco, CA 3 days ago

About Pinterest:


Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product.


Discover a career where you ignite innovation for millions, transform passion into growth opportunities, celebrate each other's unique experiences and embrace theflexibility to do your best work. Creating a career you love? It's Possible.


At Pinterest, AI isn't just a feature, it's a powerful partner that augments our creativity and amplifies our impact, and we're looking for candidates who are excited to be a part of that. To get a complete picture of your experience and abilities, we'll explore your foundational skills and how you collaborate with AI.


Through our interview process, what matters most is that you can always explain your approach, showing us not just what you know, but how you think. You can read more about our AI interview philosophy and how we use AI in our recruiting process here.

We're seeking an exceptional Staff Software Engineer to join our Observability team at Pinterest. This role combines deep technical expertise in distributed systems and data engineering with a product-oriented mindset to build world-class observability solutions that empower our engineering organization.As a Staff Engineer on the Observability team, you'll be responsible for designing and building the infrastructure and tools that provide visibility into Pinterest's large-scale distributed systems, helping thousands of engineers understand, debug, and optimize their services.


What you'll do:



  • Define and execute the observability roadmap, treating it as a product. Understand engineering team needs and translate them into technical solutions with measurable impact.
  • Architect, build, and scale distributed observability infrastructure (metrics, logs, traces) to handle massive volumes across Pinterest's distributed systems.
  • Build high-performance data pipelines and storage for real-time and historical telemetry analysis at Pinterest scale.
  • Champion Best Practices: Establish observability standards and patterns across the organization, making it easy for teams to instrument their services and gain actionable insights
  • Technical Leadership: Mentor engineers, lead architectural reviews, and influence technical decisions across teams to improve overall system reliability and performance
  • Cross-functional Collaboration: Partner with SRE, Infrastructure, Product Engineering, and other teams to understand pain points and deliver solutions that improve developer productivity and system reliability
  • Innovation: Stay current with observability trends and technologies, evaluating and adopting cutting-edge tools and techniques to keep Pinterest at the forefront

What we're looking for:



  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience.
  • Product Mindset: Demonstrated ability to work backwards from customer needs -understanding user needs, prioritizing features, measuring success, and iterating based on feedback. Experience building internal platforms or tools with strong adoption
  • Distributed Systems Expertise: 7+ years of experience designing and operating large-scale distributed systems with deep understanding of consistency, availability, scalability, and failure modes
  • Data Engineering Skills: Strong background in building data pipelines, working with time-series databases, columnar storage, stream processing (Kafka, Flink, etc.), and data modeling at scale
  • Observability Domain Knowledge: Hands-on experience with modern observability tools and practices including metrics, logging, tracing, and profiling. Familiarity with OpenTelemetry, Prometheus, Grafana, or similar technologies
  • Programming Proficiency: Expert-level coding skills in languages like Java, Python, Go, or Scala with ability to write production-quality code
  • Systems Thinking: Ability to see the big picture while managing complex technical details, balancing trade-offs between cost, performance, and reliability
  • Experience building observability platforms from the ground up or significantly scaling existing solutions
  • Familiarity with cloud-native architectures and technologies (Kubernetes, service mesh, etc.)
  • Track record of driving adoption of internal platforms through excellent documentation, UX, and developer advocacy
  • Experience with machine learning or anomaly detection applied to observability use cases
  • Strong communication skills with ability to influence stakeholders at all levels
  • Contributions to open-source observability projects, a plus


In-Office Requirement Statement:



  • We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection.
  • This role will need to be in the office for in-person collaboration 1-2 times/quarter and therefore can be situated anywhere in the country.

Relocation Statement:



  • This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.


#LI-REMOTE


#LI-JT1

At Pinterest we believe the workplace should be equitable, inclusive, and inspiring for every employee. In an effort to provide greater transparency, we are sharing the base salary range for this position. The position is also eligible for equity. Final salary is based on a number of factors including location, travel, relevant prior experience, or particular skills and expertise.


Information regarding the culture at Pinterest and benefits available for this position can be found here.

US based applicants only$177,185—$364,795 USD

Our Commitment to Inclusion:


Pinterest is an equal opportunity employer and makes employment decisions on the basis of merit. We want to have the best qualified people in every job. All qualified applicants will receive consideration for employment without regard to race, color, ancestry, national origin, religion or religious creed, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, age, marital status, status as a protected veteran, physical or mental disability, medical condition, genetic information or characteristics (or those of a family member) or any other consideration made unlawful by applicable federal, state or local laws. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you require a medical or religious accommodation during the job application process, please completethis formfor support.

Not Specified
REO Resiliency Engineering and Quality Leader (Hybrid)
✦ New
Salary not disclosed

*At Securian Financial the internal position title is Infrastructure Dir."

Mission

"To lead the engineering discipline that ensures Securian's technology platforms and cloud services are built and operated with uncompromising resilience, performance, and quality. This role drives the design and automation of fault-tolerant, high-availability architectures across AWS, Azure, and GCP-ensuring the enterprise meets resiliency, scalability, and efficiency expectations at every layer of technology."

Positioning

The Director of Resilience Engineering and Quality Leader is both a strategic peer and technical counterpart to the Infrastructure & Reliability Engineering Leader.

This role provides bench depth and succession coverage for REO's most technically complex domains while driving innovation in reliability, resilience, and performance practices.

  • Strategic influence: Shapes cloud reliability, quality engineering, and resilience strategy across REO and Architecture domains.

  • Operational authority: Leads Sr. Managers and Managers who own the execution of quality, resilience, and performance engineering capabilities.

  • Enterprise collaboration: Works hand-in-hand with Technology, Solution, Business, Data, and Enterprise Architects to embed reliability and resilience as core architecture principles.

Scope of Accountability

Resilience Engineering & Cloud Reliability

  • Architect and validate fault-tolerant, regionally resilient architectures across AWS, Azure, and GCP.

  • Own resilience automation, chaos testing, and IaC-based recovery validation.

  • Lead cross-cloud reliability design reviews and failure-mode analyses for critical systems.

Quality Engineering & Continuous Testing

  • Define enterprise-wide quality engineering strategy integrated into CI/CD pipelines.

  • Drive automation-first testing (functional, non-functional, performance, resilience).

  • Embed observability-driven quality validation and contract testing across services.

Performance, Capacity & Efficiency Engineering

  • Oversee predictive capacity planning, scaling automation, and cost/efficiency optimization (FinOps/GreenOps).

  • Partner with Platform & Infrastructure teams to tune performance across application and platform layers.

  • Measure and report on performance SLIs/SLAs aligned to REO's Reliability Metrics framework.

Cross-Domain Architecture Collaboration

  • Partner with Enterprise Architects to codify resilience and reliability standards in technology blueprints.

  • Collaborate with Technology & Solution Architects to design service reliability into delivery architectures.

  • Engage Data Architects for data resilience, replication, and pipeline reliability.

  • Work with Business Architects to align technical reliability goals with critical business outcomes.

Leadership & Talent Development

  • Lead a team of Sr. Managers and Managers, fostering a high-performance, hands-on engineering culture.

  • Build and mentor top-tier technical talent in cloud reliability, resilience, and quality automation.

  • Partner with HR and REO Enablement to develop succession plans and technical competency frameworks.

Core Technical Competencies

  • AWS (primary) - Multi-account design, HA architecture, region failover, resilience automation, Terraform/CDK/CloudFormation.

  • Azure & GCP (secondary) - Compute, networking, and reliability constructs; hybrid cloud design and failover integration.

  • Infrastructure as Code (IaC) - Deep proficiency in Terraform, policy-as-code (OPA/Conftest), drift detection, pipeline integration.

  • Reliability & Chaos Engineering - AWS Fault Injection Simulator, Gremlin, steady-state hypothesis design.

  • Observability & Quality Automation - OpenTelemetry, Prometheus, CloudWatch, K6, Gatling; CI/CD quality gates and dashboards.

  • Performance Engineering - Load, stress, and soak testing automation; performance profiling and SLO alignment.

  • Disaster Recovery Automation - Cross-region orchestration, IaC-driven DR runs, replication validation.

  • FinOps/GreenOps - Cloud cost and efficiency automation, carbon-aware scaling policies.

Leadership Competencies

  • Strategic Technical Leadership: Operates at the intersection of deep engineering and executive strategy.

  • Multi-Domain Collaborator: Integrates reliability and resilience across architecture, operations, and business domains.

  • Talent Multiplier: Develops and empowers senior managers, fostering engineering mastery and innovation.

  • Credible Technical Authority: Trusted peer to Infrastructure & Reliability Engineering; capable of leading architecture reviews and executive briefings.

  • Change Champion: Drives transformation of reliability practices across platforms, pipelines, and teams.

Qualifications & Experience

  • 12+ years in cloud engineering, reliability, or platform leadership roles.

  • 5+ years leading Sr. Managers/Managers in technical domains.

  • Proven expertise across AWS, with working knowledge of Azure and GCP.

  • Experience with multi-cloud governance, DR design, IaC at scale, and reliability automation.

  • Strong understanding of observability, SRE principles, and REO/ITIL-aligned reliability frameworks.

  • Certifications:

    • Required: AWS Certified Solutions Architect - Professional

    • Preferred: AWS DevOps Engineer, Azure Solutions Architect Expert, Google Professional Cloud Architect

Success Metrics

  • 99.9% availability maintained for Tier-1 workloads.

  • 100% coverage of DR automation for Tier-1 services.

  • 25% annual increase in automated quality/test coverage.

  • 15% annual improvement in resource efficiency and cost performance.

  • Documented resilience participation across all enterprise architecture blueprints.

  • Positive "technical peer readiness" and succession rating from Head of REO.

Summary Value Proposition

This Director role blends deep AWS reliability engineering expertise, multi-cloud technical breadth, and leadership scale.

It ensures REO maintains both technical depth and leadership redundancy, and it strengthens the bridge between engineering execution and enterprise architecture alignment.

#LI-hybrid **This position will be in a hybrid working arrangement.**


Securian Financial believes in hybrid work as an integral part of our culture. Associates get the benefit of working both virtually and in our offices. If you're in a commutable distance (90 minutes), you'll join us 3 days each week in our offices to collaborate and build relationships. Our policy allows flexibility for the reality of business and personal schedules.

The estimated base pay range for this job is:

$145,000.00 - $267,000.00

Pay may vary depending on job-related factors and individual experience, skills, knowledge, etc. More information on base pay and incentive pay (if applicable) can be discussed with a member of the Securian Financial Talent Acquisition team.

Be you. With us. At Securian Financial, we understand that attracting top talent means offering more than just a job - it means providing a rewarding and fulfilling career. As a valued member of our high-performing team, we want you to connect with your work, your relationships and your community. Enjoy our comprehensive range of benefits designed to enhance your professional growth, well-being and work-life balance, including the advantages listed here:

Paid time off:

  • We want you to take time off for what matters most to you. Our PTO program provides flexibility for associates to take meaningful time away from work to relax, recharge and spend time doing what's important to them. And Securian Financial rewards associates for their service by providing additional PTO the longer you stay at Securian.

  • Leave programs: Securian's flexible leave programs allow time off from work for parental leave, caregiver leave for family members, bereavement and military leave.

  • Holidays: Securian provides nine company paid holidays.

Company-funded pension plan and a 401(k) retirement plan: Share in the success of our company. Securian's 401(k) company contribution is tied to our performance up to 10 percent of eligible earnings, with a target of 5 percent. The amount is based on company results compared to goals related to earnings, sales and service.

Health insurance: From the first day of employment, associates and their eligible family members - including spouses, domestic partners and children - are eligible for medical, dental and vision coverage.

Volunteer time: We know the importance of community. Through company-sponsored events, volunteer paid time off, a dollar-for-dollar matching gift program and more, we encourage you to support organizations important to you.

Associate Resource Groups: Build connections, be yourself and develop meaningful relationships at work through associate-led ARGs. Dedicated groups focus on a variety of interests and affinities, including:

  • Mental Wellness and Disability

  • Pride at Securian Financial

  • Securian Young Professionals Network

  • Securian Multicultural Network

  • Securian Women and Allies Network

  • Servicemember Associate Resource Group

For more information regarding Securian's benefits, please review our Benefits page.

This information is not intended to explain all the provisions of coverage available under these plans. In all cases, the plan document dictates coverage and provisions.

Securian Financial Group, Inc. does not discriminate based on race, color, religion, national origin, sex, gender, gender identity, sexual orientation, age, marital or familial status, pregnancy, disability, genetic information, political affiliation, veteran status, status in regard to public assistance or any other protected status. If you are a job seeker with a disability and require an accommodation to apply for one of our jobs, please contact us by email at , by telephone (voice), or 711 (Relay/TTY).

To view our privacy statement click here

To view our legal statement click here


Remote working/work at home options are available for this role.
Not Specified
Software engineer–devsecops (senior or lead)
🏢 Boeing
Salary not disclosed
Tukwila, Washington 4 days ago

Job DescriptionAt Boeing, we innovate and collaborate to make the world a better place. We're committed to fostering an environment for every teammate that's welcoming, respectful and inclusive, with great opportunity for professional growth. Find your future with us.The Boeing Company is currently seeking a Software Engineer–Dev Sec Ops (Senior or Lead) to support our E7 A Software Development Environments team located in Tukwila, Washington .This team is responsible for building and maintaining multiple secure development environments that enable continuous integration and secure software development for the US Air force E-7 A Rapid Prototype Program. This position will focus on developing and maintaining RKE2 and EKS backed software development tools supporting Maven Gradle and Python development pipelines in both AWS and onsite disconnected environments.Position Responsibilities:Lead the design, development, testing and verification of E-7 A software engineering toolsDevelop and enhance our Dev Sec Ops operations across multiple environmentsCollaborate with cross-functional teams to integrate and implement solutions for software developmentProvide end user support and troubleshooting to software development teams using program defined pipeline configurations and development environmentsMonitor and maintain software development environments and toolsParticipate in peer reviews and provide technical leadership to junior engineers

Utilize analytical problem-solving skills to resolve issues and improve processes and toolsOur team is currently hiring for a broad range of experience levels including: Senior and Lead Engineers.This position is expected to be 100% onsite. The selected candidate will be required to work onsite at one of the listed location options.Security Clearance and Export Control Requirements:This position requires an active Secret U. S. Security Clearance. (A U. S. Security Clearance that has been active in the past 24 months is considered active.)Basic Qualifications (Required Skills/ Experience):3+ years of experience administering Linux systems3+ years of experience administering Kubernetes EKS or RKE2 deployments3+ years of experience with AWS cloud development and containerization

Programming or scripting experience in Java or C++ and PythonPreferred Qualifications (Desired Skills/Experience):Bachelor of Science degree from an accredited course of study in engineering, engineering technology (includes manufacturing engineering technology), chemistry, physics, mathematics, data science, or computer scienceAbility to set up, and manage a toolchain that uses Git Lab, Git Lab-CI, Gradle, Maven, Nexus/Artifactory, Sonar Qube, Prometheus, Fluent Bit, Cloud Watch, Helm, Terraform, Ansible, Open Shift, Docker and moreAWS Certification is preferredContainerization knowledge including Docker, Kubernetes, Helm, Rancher, Istio, Big BangUnderstanding of designing and implementing full stack/Microservice infrastructureUnderstanding of secure software development methodologies and Security First mindsetUnderstanding of cloud architecture and design methodologiesStrong Working knowledge of the CI/CD process including debugging, test, and integration of software toolsKnowledge of general software development and testing tools, including compilers, linkers, debuggers, and requirements management tools

Experience with Confluence, JiraDrug Free Workplace:Boeing is a Drug Free Workplace (DFW) where post offer applicants and employees are subject to testing for marijuana, cocaine, opioids, amphetamines, PCP, and alcohol when criteria is met as outlined in our policies.Union:This is a union-represented position.Code Vue Coding Challenge:To be considered for this position you will be required to complete a technical assessment as part of the selection process. Failure to complete the assessment will remove you from consideration.Pay & Benefits:At Boeing, we strive to deliver a Total Rewards package that will attract, engage and retain the top talent. Elements of the Total Rewards package include competitive base pay and variable compensation opportunities.The Boeing Company also provides eligible employees with an opportunity to enroll in a variety of benefit programs, generally including health insurance, flexible spending accounts, health savings accounts, retirement savings plans, life and disability insurance programs, and a number of programs that provide for both paid and unpaid time away from work.The specific programs and options available to any given employee may vary depending on eligibility factors such as geographic location, date of hire, and the applicability of collective bargaining agreements.Pay is based upon candidate experience and qualifications, as well as market and business considerations.Summary pay range for Senior Level: $130,900 - $177,100.Summary pay range for Lead Level: $161,500 - $218,500.Applications for this position will be accepted until Mar. 30, 2026Export Control Requirements:This position must meet U. S. export control compliance requirements. To meet U. S. export control compliance requirements, a "U. S. Person" as defined by 22 C. F. R. §120.62 is required. "U. S. Person" includes U. S. Citizen, U. S. National, lawful permanent resident, refugee, or asylee.Export Control Details:US based job, US Person requiredRelocationThis position offers relocation based on candidate eligibility.Security ClearanceThis position requires an active U. S. Secret Security Clearance (U. S. Citizenship Required). (A U. S. Security Clearance that has been active in the past 24 months is considered active)Visa SponsorshipEmployer will not sponsor applicants for employment visa status.ShiftThis position is for 1st shiftEqual Opportunity Employer:Boeing is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, age, physical or mental disability, genetic factors, military/veteran status or other characteristics protected by law.

Not Specified
Senior Software Deployment & Customer Operations Engineer
Salary not disclosed
Boston, MA 6 days ago

Senior Software Engineer – Deployment & Reliability (Digital Pathology / Medical Imaging)

A fast-growing technology company operating in the digital pathology and medical imaging space is seeking a Senior Software Engineer to support the deployment, configuration, and long-term reliability of advanced imaging and AI-driven software systems.


This role sits at the intersection of software deployment, infrastructure engineering, and site reliability, ensuring complex software platforms are successfully installed, integrated with customer IT environments, and maintained at high levels of performance and stability.


You will work closely with engineering, customer support, and monitoring teams to ensure a smooth transition from system deployment to ongoing operational support while contributing to improvements that make deployments more scalable and reliable over time.


Key Responsibilities

Deployment & Configuration

  • Lead end-to-end deployments of imaging, AI, and data management software systems at customer environments
  • Configure and integrate servers, clusters, and storage systems within hospital or laboratory IT infrastructures
  • Work with networking, authentication, storage, and security configurations to ensure successful installations
  • Collaborate with field engineering teams during system installation and commissioning
  • Develop standardized deployment playbooks, documentation, and validation checklists

System Reliability & Upgrades

  • Manage software version rollouts, upgrades, and patching across deployed customer environments
  • Work with monitoring and observability teams to track system performance and health
  • Troubleshoot complex issues across multi-component systems including imaging software, AI inference pipelines, and storage layers
  • Improve automation around upgrades, rollbacks, and maintenance processes

Engineering Collaboration & Continuous Improvement

  • Identify recurring deployment or performance challenges and work with R&D teams to design long-term solutions
  • Provide structured feedback from field deployments to improve product architecture and deployment workflows
  • Validate new deployment tools, frameworks, and configuration approaches prior to wider rollout
  • Contribute to improving the scalability and resilience of the overall platform

Customer IT & Cross-Functional Collaboration

  • Serve as a technical liaison with customer IT teams regarding networking, infrastructure, security, and data access
  • Ensure deployments comply with institutional IT policies and healthcare regulatory requirements
  • Collaborate closely with support and monitoring teams to align escalation processes and root cause investigations
  • Participate in post-deployment reviews to improve operational processes and reliability

Documentation & Knowledge Sharing

  • Maintain detailed installation and configuration documentation
  • Develop deployment guides, troubleshooting documentation, and internal knowledge resources
  • Support and mentor field teams on standardized deployment and configuration practices


Requirements

  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or related discipline
  • 5+ years of experience in software deployment, DevOps, infrastructure engineering, or systems engineering
  • Strong Linux (Ubuntu) administration and scripting skills
  • Experience with containerization and orchestration technologies (Docker, Kubernetes)
  • Experience with database technologies such as PostgreSQL or MongoDB
  • Familiarity with web service configuration (Nginx or Apache)
  • Solid understanding of networking concepts including VPNs, firewalls, and authentication systems
  • Ability to troubleshoot complex distributed systems across software, infrastructure, and data layers
  • Strong communication and collaboration skills when working with cross-functional teams and customer IT stakeholders


Preferred Experience

  • Exposure to medical imaging systems, digital pathology, or healthcare technology environments
  • Familiarity with DICOM or PACS systems
  • Experience deploying or supporting AI/ML models in production environments
  • Experience with observability and monitoring tools (Prometheus, Grafana, ELK)
  • Knowledge of regulated environments and healthcare compliance frameworks (HIPAA, GDPR, IVDR)
  • Experience supporting hardware and software integrated systems


Why This Role

This position offers the opportunity to work on advanced digital pathology and imaging technologies that support clinical diagnostics and research globally. The role combines hands-on technical deployment with the chance to influence how complex systems are designed, automated, and scaled across a growing global customer base.

Not Specified
Observability and AI Enterprise Architect
✦ New
🏢 ClifyX
Salary not disclosed
Edison, NJ 1 day ago

Key Responsibilities:

  • Design and deploy observability frameworks leveraging tools such as Grafana, Dynatrace, Prometheus, ELK, Splunk, etc. Define best practices for monitoring, alerting, and visualization across hybrid and multi-cloud environments.
  • Develop strategies for monitoring KPIs tied to business outcomes (e.g., sales performance, supply chain efficiency, customer experience).
  • Collaborate with business and IT teams to identify key metrics and integrate them into dashboards and alerting systems.
  • Implement AIOps solutions using industry-leading platforms like OpenAI, AWS Bedrock, Google Gemini, Anthropic, and similar technologies.
  • Develop predictive analytics and anomaly detection models to proactively identify and resolve operational issues.
  • Integrate observability tools with ITSM platforms and automation workflows. Enable automated root cause analysis and remediation using AI/ML models.
  • Provide observability strategies for infrastructure (servers, storage, cloud), applications (microservices, APIs), and networks (LAN/WAN, SD-WAN). Collaborate with DevOps, SRE, and IT operations teams to ensure end-to-end visibility and reliability.
  • Establish observability standards, KPIs, and SLAs for performance and availability. Ensure compliance with security and regulatory requirements in monitoring solutions.
  • Develop scalable architecture using LLMs, agentic frameworks, and multi-modal AI technologies.
  • Build AI-powered analytics platforms for IT operations analysis, anomaly detection, and predictive insights.
  • Architect and deploy intelligent chatbots for IT support and self-service capabilities.
  • Integrate AI solutions with existing IT operations tools and workflows.
  • Implement automated remediation and root cause analysis using AI/ML models.


Qualifications:

  • 10-13 years of relevant experience
  • Hands-on experience with Grafana, Dynatrace, and other monitoring platforms.
  • Practical experience implementing AI-based solutions for anomaly detection, predictive maintenance, and automated remediation. Familiarity with OpenAI, Bedrock, Gemini, Anthropic, or similar AI platforms.
  • Strong understanding of infrastructure, application architectures, and networking. Experience with cloud platforms (AWS, Azure, GCP) and container orchestration (Kubernetes).
  • Proficiency in Python, Bash, or similar scripting languages for automation and integration.
  • Strong experience with LLMs (OpenAI, Anthropic, Gemini, Bedrock) and agentic AI solutions.
  • Hands-on experience in designing AI architectures for enterprise IT environments.
  • Proficiency in Python or similar languages for AI model integration and automation.
Not Specified
RN/LPN Full Time Night Shift + Daily Pay
✦ New
Salary not disclosed
Grafton, Ohio 1 day ago

FT NightsPT DaysShifts:We offer a great FULL TIME benefits and perks package!Short Term Disability (Guardian)-for employee only, benefit percentage 60% of salary!Long Term Disability (Guardian)-for employee only, benefit percentage 60% of salary!Health Advocate (Employee Assistance Program)-for Employee, Spouse, Dependents, Parents, and Parents in Law.Examples that are available for help: Medical (BCBS)-for Employee, Spouse, and/or Dependents.HSA (Health Savings Account) is optional if Medical is selected. Great tax benefit!Dental (Guardian)-for Employee, Spouse, and/or Dependents.Hospital Indemnity (Guardian)-for Employee, Spouse, and/or Dependents.Metlife Legal (Legal Shield)-for Employee, Spouse, and/or Dependents.Assistance with Adoption, Lawyers, Wills and Trusts and much more!No waiting periods, no claim forms, no deductibles!Wide range of coverages for your fur babies!All dog and cat breeds are covered.

~ Tuition Reimbursement

Worked Holidays Paid @ Double Time!On Demand Pay Option (Examples: ZayZoon, Daily Pay)Bonuses:Employee Referral Bonus OpportunitiesShift Pick Up BonusesTraining Bonuses

We offer a great PART TIME perks package too!Worked Holidays Paid @ Double Time!On Demand Pay Option (Examples: ZayZoon, Daily Pay)Bonuses:Employee Referral Bonus OpportunitiesShift Pick Up BonusesTraining Bonuses

You walk in the door to a work family who wants to make the day count. We truly believe our employees and residents are a family that comes together to enjoy the good things in life, including one another. When our employees feel special, so do our residents. We are currently seeking applicants for Licensed Practical Nurse (LPN) positions. This position is also often referred to as Practical Nurse or PN. What do you do as an LPN at Danbury?~ Our Licensed Practical Nurses provide direct nursing care to residents, prepare and administer medications, perform routine charting and documentation duties, and perform other duties necessary to ensure that our residents' total regimens of care are maintained.

What experience or skills do you need to be a Licensed Practical Nurse at Danbury?Experience in a nursing capacity in a senior living setting is helpful, but not required.We are seeking Licensed Practical Nurses (LPNs) who are outstanding in their profession and would work well with our team.If you're a Licensed Practical Nurse (LPN) and want to make our residents' days better then apply now for immediate consideration!Danbury Senior Living provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.

permanent
Licensed Practical Nurse (LPN) Grafton
✦ New
🏢 Danbury North Ridgeville
Salary not disclosed
Grafton, Ohio 1 day ago

Danbury does not require employees to be vaccinated. Pay rate for this position is $27.00 up to $30.00

Openings:

  • FT Nights
  • PT Days

Shifts:

  • 6a-6p
  • 6p-6a

We offer a great FULL TIME benefits and perks package!

  • Company Paid Benefits:
    • Short Term Disability (Guardian)-for employee only, benefit percentage 60% of salary!
    • Long Term Disability (Guardian)-for employee only, benefit percentage 60% of salary!
    • Life and AD&D (Guardian)
    • Health Advocate (Employee Assistance Program)-for Employee, Spouse, Dependents, Parents, and Parents in Law.
      • Examples that are available for help: Emotional Support-Stress, Realtionships, Addictions, Mental Illness, Anger, Loss, Depression, Time Management.
      • Work and Life Balance Specialists
  • Employee Optional Benefits:
    • Medical (BCBS)-for Employee, Spouse, and/or Dependents.
      • HSA (Health Savings Account) is optional if Medical is selected. Great tax benefit!
    • Dental (Guardian)-for Employee, Spouse, and/or Dependents.
    • Vision (Guardian VSP)-for Employee, Spouse, and/or Dependents.
    • Additional Voluntary Life (Guardian)-for Employee, Spouse, and/or Dependents.
    • Additional Voluntary AD&D (Guardian)
    • Critical Illness (Guardian)-for Employee, Spouse, and/or Dependents.
    • Hospital Indemnity (Guardian)-for Employee, Spouse, and/or Dependents.
    • Accident (Guardian)
    • Metlife Legal (Legal Shield)-for Employee, Spouse, and/or Dependents.
      • Assistance with Adoption, Lawyers, Wills and Trusts and much more!
      • No waiting periods, no claim forms, no deductibles!
    • Metlife Pet Insurance
      • Wide range of coverages for your fur babies!
        • All dog and cat breeds are covered.
    • Identity Theft (All State)
    • 401(k) with Matching (TransAmerica)
    • Tuition Reimbursement
  • Perks:
    • Vacation from 90th Day of Employment
    • Worked Holidays Paid @ Double Time!
    • On Demand Pay Option (Examples: ZayZoon, Daily Pay)
  • Bonuses:
    • Employee Referral Bonus Opportunities
    • Shift Pick Up Bonuses
    • Training Bonuses

We offer a great PART TIME perks package too!

  • Perks:
    • Worked Holidays Paid @ Double Time!
    • On Demand Pay Option (Examples: ZayZoon, Daily Pay)
    • Opportunity for Advancement within the Company!
  • Benefits:
    • 401(k) with Matching (TransAmerica)
  • Bonuses:
    • Employee Referral Bonus Opportunities
    • Shift Pick Up Bonuses
    • Training Bonuses

At Danbury, you don't just clock in at a job. You walk in the door to a work family who wants to make the day count. We truly believe our employees and residents are a family that comes together to enjoy the good things in life, including one another. When our employees feel special, so do our residents. That's the Danbury Difference.

We are currently seeking applicants for Licensed Practical Nurse (LPN) positions. This position is also often referred to as Practical Nurse or PN. Our current available opportunities are:

What do you do as an LPN at Danbury?

  • Our Licensed Practical Nurses provide direct nursing care to residents, prepare and administer medications, perform routine charting and documentation duties, and perform other duties necessary to ensure that our residents' total regimens of care are maintained.

What experience or skills do you need to be a Licensed Practical Nurse at Danbury?

  • We are looking for applicants who are licensed by the State of Ohio.
  • Experience in a nursing capacity in a senior living setting is helpful, but not required.
  • We are seeking Licensed Practical Nurses (LPNs) who are outstanding in their profession and would work well with our team.

If you're a Licensed Practical Nurse (LPN) and want to make our residents' days better then apply now for immediate consideration!

Danbury Senior Living provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.

IND123

Not Specified
jobs by JobLookup
✓ All jobs loaded