Alibaba Cloud Linux, OS Jobs in Usa
2,097 positions found — Page 10
Hybrid (DFW Metroplex Preferred) | Texas or nearby states considered with travel expectations
3–6 Month Contract-to-HireOverview
We are seeking a highly skilled FinOps Implementation Consultant to serve as the founding hire for a growing FinOps practice. This role is both technical and consultative, focused on helping enterprise clients bridge the gap between IT and Finance to better manage, optimize, and govern cloud services spend.
This individual will lead end-to-end implementations of IBM Cloudability, act as a trusted advisor to executive stakeholders, and support functional presales efforts. The role is designed to evolve into a FinOps Practice Lead position, with responsibility for building, mentoring, and managing an internal FinOps team over time.
The ideal candidate is a strong communicator who can operate comfortably at both the technical and executive levels, translating complex cloud cost data into actionable business insights.
This role is perfect for a senior FinOps or cloud cost optimization professional ready to step into a foundational leadership position. The long-term vision is for this individual to grow into a FinOps Practice Lead, building and managing a high-performing internal team while continuing to serve as a trusted advisor to enterprise clients.Key Responsibilities
- Lead the end-to-end implementation of IBM Cloudability for enterprise clients, including technical integration, configuration, and optimization.
- Serve as a consultative partner to client IT, engineering, and finance teams, helping align cloud usage with financial governance and business objectives.
- Design FinOps strategies, tagging and allocation models, and reporting structures to enable accurate chargeback, showback, and cost visibility.
- Present findings, insights, and recommendations to senior leadership and C-level executives.
- Identify cost optimization opportunities and help clients establish sustainable FinOps operating models.
- Provide hands-on training, documentation, and knowledge transfer to client teams.
- Support functional presales activities, including solution positioning, scoping, and client presentations.
- Stay current with FinOps best practices, Cloudability product updates, and cloud cost optimization trends.
- Contribute to the long-term vision of the FinOps practice, including process development and team growth.
Required Qualifications
- 3+ years of experience in FinOps, cloud cost management, or cloud financial optimization.
- Hands-on experience implementing and configuring IBM Cloudability from a technical standpoint.
- Strong understanding of public cloud platforms (AWS, Azure, GCP) and their billing and cost models.
- Experience with Kubernetes cost visibility tools such as Kubecost is a plus.
- Proficiency with Linux, YAML, Helm, and CLI-based deployments.
- Familiarity with IT financial management, budgeting, and forecasting processes.
- Exceptional communication and presentation skills with the ability to engage both technical teams and executive leadership.
- Strong analytical and problem-solving skills, with the ability to translate data into business value.
- Consultative mindset with experience advising enterprise clients.
- Comfortable operating independently as a founding role and shaping a new practice.
- Interest in mentoring, leading, and growing a team over time.
- DFW metroplex candidates preferred with a hybrid schedule (3 days in office, 2 days remote).
- Candidates based in Texas or nearby states considered, with the ability to travel to the office one week per month. Client travel estimated at approximately 25–30%, including ad hoc client meetings.
- Initial 3–6 month contract with intent to convert to full-time employment.
- Upon conversion, benefits include: Medical, dental, and vision insurance 401(k) plan (no employer match at this time) Two (2) weeks of PTO Seven (7) paid holidays
Job Description
At Boeing, we innovate and collaborate to make the world a better place. We’re committed to fostering an environment for every teammate that’s welcoming, respectful and inclusive, with great opportunity for professional growth. Find your future with us.
The Boeing Company is currently seeking a Lead Software Engineer – DevSecOps to support our Phantom Works Virtual Warfare Center team located in Berkeley, MO. This position will focus on supporting the Boeing Defense, Space & Security (BDS) business organization.
The DevSecOps Lead Engineer will architect and implement secure development and execution environments for the rapid prototyping and experimentation we use to answer our customers’ toughest questions about future technologies and capabilities. The Virtual Warfare Center executes far-reaching analysis to address military capability gaps in and across multiple warfighting domains in the face of accelerating adversary capabilities. In DevSecOps you will be part of a team modernizing our approach to software development and enhancing our security posture.
As the Virtual Warfare Center’s DevSecOps Team Lead you will lead a team of engineers designing, implementing, and monitoring software development infrastructure across multiple networks and physical locations across the United States. You will build and maintain cross-functional relationships with multiple teams to coordinate the selection, approval, deployment, and maintenance of a consistent set of software tools in all locations. Your work will guarantee our development and deployment infrastructure and processes are reliable, efficient, consistent, and secure. Your team will partner with relevant stakeholders to create processes, design cloud-based solutions, support deploying applications in cloud environments, evaluate solution performance and implement enhancements. You will guide the team through the update of a legacy software development infrastructure to use modern technologies including containers, cloud, high performance computing, AI/ML, and automation. This position requires mentoring early-career employees on DevSecOps design, implementation, maintenance, communication, and leadership skills. Your team will track required software updates and drive the process to eliminate known vulnerabilities including monitoring systems, tools, and software packages for security vulnerabilities. You will contribute to a collaborative, cross-functional team managing software security approvals and automate the integration of security into all phases of the software development lifecycle. Your work with an array of software development, IT, and cybersecurity teams will address emergent issues while improving the efficiency and usability of our systems and software products.
Position Responsibilities:
- Lead a team of engineers responsible for designing, installing, configuring, and maintaining a consistent, secure software development toolchain across multiple networks and physical locations.
- Spearhead the approval and implementation of continuous integration and continuous deployment pipelines into collateral secret and program spaces.
- Coordinate between software development, IT, and security teams on vulnerability tracking and mitigation, driving efforts forward.
- Architect and implement the transition of a multi-site, multi-network software development environment into a cloud-based approach.
- Lead trade studies and tool selection to upgrade and modernize software development processes and operational infrastructure.
- Lead implementation of best practices and methodologies for provisioning, platform scaling, configuration management, monitoring and troubleshooting
- Maintain the DevSecOps vision and roadmap, track status, and communicate progress to stakeholders.
- Mentor and coach the team, provide technical leadership, foster a culture of knowledge sharing and continuous learning, and grow their skills.
Basic Qualifications (Required Skills/ Experience):
- Bachelor’s Degree in an engineering discipline or 17+ years equivalent related experience
- 10+ years’ experience with software engineering
- 3+ years’ experience with scripting languages such as Bash or Python
- 3+ years’ experience containerized software development
- 3+ years’ experience supporting DevSecOps lifecycle
- Experience with Agile development practices using continuous integration and deployment
- 3+ years of experience performing automation, implementation and deployments in both Windows and Linux systems
- Active Secret clearance
Preferred Qualifications (Desired Skills/Experience):
- Active Top Secret SCI clearance
- Experience with gitlab
- Experience with Jenkins
- Experience with JIRA
- 3+ years’ experience supporting cloud development environments
- Experience with cloud computing in classified environments
- CompTIA Security+
- Bachelor of Science degree from an accredited course of study in engineering, engineering technology (includes manufacturing engineering technology), chemistry, physics, mathematics, data science, or computer science.
Travel: 10%
Drug Free Workplace:
Boeing is a Drug Free Workplace (DFW) where post offer applicants and employees are subject to testing for marijuana, cocaine, opioids, amphetamines, PCP, and alcohol when criteria is met as outlined in our policies.
CodeVue Coding Challenge:
To be considered for this position you will be required to complete a technical assessment as part of the selection process. Failure to complete the assessment will remove you from consideration.
Pay & Benefits:
At Boeing, we strive to deliver a Total Rewards package that will attract, engage and retain the top talent. Elements of the Total Rewards package include competitive base pay and variable compensation opportunities.
The Boeing Company also provides eligible employees with an opportunity to enroll in a variety of benefit programs, generally including health insurance, flexible spending accounts, health savings accounts, retirement savings plans, life and disability insurance programs, and a number of programs that provide for both paid and unpaid time away from work.
The specific programs and options available to any given employee may vary depending on eligibility factors such as geographic location, date of hire, and the applicability of collective bargaining agreements.
Pay is based upon candidate experience and qualifications, as well as market and business considerations.
Summary Pay Range for Lead: $136,850 - $185,150
Applications for this position will be accepted until Mar. 25, 2026
Export Control Requirements:
This position must meet U.S. export control compliance requirements. To meet U.S. export control compliance requirements, a “U.S. Person” as defined by 22 C.F.R. §120.62 is required. “U.S. Person” includes U.S. Citizen, U.S. National, lawful permanent resident, refugee, or asylee.
Export Control Details:
US based job, US Person required
Relocation
This position offers relocation based on candidate eligibility.
Security Clearance
This position requires an active U.S. Secret Security Clearance (U.S. Citizenship Required). (A U.S. Security Clearance that has been active in the past 24 months is considered active)
Visa Sponsorship
Employer will not sponsor applicants for employment visa status.
Shift
This position is for 1st shift
Equal Opportunity Employer:
Boeing is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, age, physical or mental disability, genetic factors, military/veteran status or other characteristics protected by law.
Job Description
At Boeing, we innovate and collaborate to make the world a better place. We’re committed to fostering an environment for every teammate that’s welcoming, respectful and inclusive, with great opportunity for professional growth. Find your future with us.
The Boeing Company is currently seeking a Lead Software Engineer – DevSecOps to support our Phantom Works Virtual Warfare Center team located in Berkeley, MO. This position will focus on supporting the Boeing Defense, Space & Security (BDS) business organization.
The DevSecOps Lead Engineer will architect and implement secure development and execution environments for the rapid prototyping and experimentation we use to answer our customers’ toughest questions about future technologies and capabilities. The Virtual Warfare Center executes far-reaching analysis to address military capability gaps in and across multiple warfighting domains in the face of accelerating adversary capabilities. In DevSecOps you will be part of a team modernizing our approach to software development and enhancing our security posture.
As the Virtual Warfare Center’s DevSecOps Team Lead you will lead a team of engineers designing, implementing, and monitoring software development infrastructure across multiple networks and physical locations across the United States. You will build and maintain cross-functional relationships with multiple teams to coordinate the selection, approval, deployment, and maintenance of a consistent set of software tools in all locations. Your work will guarantee our development and deployment infrastructure and processes are reliable, efficient, consistent, and secure. Your team will partner with relevant stakeholders to create processes, design cloud-based solutions, support deploying applications in cloud environments, evaluate solution performance and implement enhancements. You will guide the team through the update of a legacy software development infrastructure to use modern technologies including containers, cloud, high performance computing, AI/ML, and automation. This position requires mentoring early-career employees on DevSecOps design, implementation, maintenance, communication, and leadership skills. Your team will track required software updates and drive the process to eliminate known vulnerabilities including monitoring systems, tools, and software packages for security vulnerabilities. You will contribute to a collaborative, cross-functional team managing software security approvals and automate the integration of security into all phases of the software development lifecycle. Your work with an array of software development, IT, and cybersecurity teams will address emergent issues while improving the efficiency and usability of our systems and software products.
Position Responsibilities:
- Lead a team of engineers responsible for designing, installing, configuring, and maintaining a consistent, secure software development toolchain across multiple networks and physical locations.
- Spearhead the approval and implementation of continuous integration and continuous deployment pipelines into collateral secret and program spaces.
- Coordinate between software development, IT, and security teams on vulnerability tracking and mitigation, driving efforts forward.
- Architect and implement the transition of a multi-site, multi-network software development environment into a cloud-based approach.
- Lead trade studies and tool selection to upgrade and modernize software development processes and operational infrastructure.
- Lead implementation of best practices and methodologies for provisioning, platform scaling, configuration management, monitoring and troubleshooting
- Maintain the DevSecOps vision and roadmap, track status, and communicate progress to stakeholders.
- Mentor and coach the team, provide technical leadership, foster a culture of knowledge sharing and continuous learning, and grow their skills.
Basic Qualifications (Required Skills/ Experience):
- Bachelor’s Degree in an engineering discipline or 17+ years equivalent related experience
- 10+ years’ experience with software engineering
- 3+ years’ experience with scripting languages such as Bash or Python
- 3+ years’ experience containerized software development
- 3+ years’ experience supporting DevSecOps lifecycle
- Experience with Agile development practices using continuous integration and deployment
- 3+ years of experience performing automation, implementation and deployments in both Windows and Linux systems
- Active Secret clearance
Preferred Qualifications (Desired Skills/Experience):
- Active Top Secret SCI clearance
- Experience with gitlab
- Experience with Jenkins
- Experience with JIRA
- 3+ years’ experience supporting cloud development environments
- Experience with cloud computing in classified environments
- CompTIA Security+
- Bachelor of Science degree from an accredited course of study in engineering, engineering technology (includes manufacturing engineering technology), chemistry, physics, mathematics, data science, or computer science.
Travel: 10%
Drug Free Workplace:
Boeing is a Drug Free Workplace (DFW) where post offer applicants and employees are subject to testing for marijuana, cocaine, opioids, amphetamines, PCP, and alcohol when criteria is met as outlined in our policies.
CodeVue Coding Challenge:
To be considered for this position you will be required to complete a technical assessment as part of the selection process. Failure to complete the assessment will remove you from consideration.
Pay & Benefits:
At Boeing, we strive to deliver a Total Rewards package that will attract, engage and retain the top talent. Elements of the Total Rewards package include competitive base pay and variable compensation opportunities.
The Boeing Company also provides eligible employees with an opportunity to enroll in a variety of benefit programs, generally including health insurance, flexible spending accounts, health savings accounts, retirement savings plans, life and disability insurance programs, and a number of programs that provide for both paid and unpaid time away from work.
The specific programs and options available to any given employee may vary depending on eligibility factors such as geographic location, date of hire, and the applicability of collective bargaining agreements.
Pay is based upon candidate experience and qualifications, as well as market and business considerations.
Summary Pay Range for Lead: $136,850 - $185,150
Applications for this position will be accepted until Mar. 25, 2026
Export Control Requirements:
This position must meet U.S. export control compliance requirements. To meet U.S. export control compliance requirements, a “U.S. Person” as defined by 22 C.F.R. §120.62 is required. “U.S. Person” includes U.S. Citizen, U.S. National, lawful permanent resident, refugee, or asylee.
Export Control Details:
US based job, US Person required
Relocation
This position offers relocation based on candidate eligibility.
Security Clearance
This position requires an active U.S. Secret Security Clearance (U.S. Citizenship Required). (A U.S. Security Clearance that has been active in the past 24 months is considered active)
Visa Sponsorship
Employer will not sponsor applicants for employment visa status.
Shift
This position is for 1st shift
Equal Opportunity Employer:
Boeing is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, age, physical or mental disability, genetic factors, military/veteran status or other characteristics protected by law.
Business Area:
EngineeringSeniority Level:
AssociateJob Description:
At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world's largest enterprises.
At Cloudera, our Data Services Pillar is the heart of data innovation. We don't just work with technology; we build it. Our mission is to empower data practitioners by creating seamless, enterprise-grade experiences for data engineering, warehousing, streaming, operational databases, and AI.
You will be a key member of the NFQE (Non Functional QE) team that drives the performance reliability of Cloudera's Kuberneteshosted data services. The role blends deep technical knowledge of performance testing, distributed data workloads, and container orchestration with a datadriven mindset. You'll design, automate, run, and analyze performance tests for Cloudera's flagship services, ensuring they meet or exceed customerdefined SLOs/SLAs at scales.
As a Performance Engineer, you will:
Work with internal development teams and the open source community to proactively drive performance improvements/optimizations across our data warehouse and Data Engineering stack.
Work with product managers, developers and the field team to understand performance and scale requirements, and develop benchmarks based on these requirements.
Develop automation to execute benchmarks, collect and aggregate metrics and profiles, and report results, trends, and regressions.
Analyze performance and scalability characteristics to identify bottlenecks in large-scale distributed systems.
Perform root cause analysis of performance issues identified by internal testing and from customers and suggest corrective actions.
Evaluate performance of systems and provide related guidance to the team.
We are excited about you if you have:
3 + years of industry experience in performance-related work, ideally on large-scale distributed systems
Understanding of DBMS algorithms and data structure fundamentals.
Understanding of hardware trends and full-stack systems performance: CPU, RAM, storage, network, Linux kernel, JVM, and distributed systems performance.
Understanding of performance analysis tools and techniques.
Strong design, coding skills, and test automation skills (Java/C++/Golang/Python preferred)
Knowledge of relevant frameworks, cloud provider knowledge, K8s, etc.
Ability to work in a distributed setting with team members spread in multiple geographies
Demonstrated ability to work on large cross-functional projects, including strong written communication skills and a collaborative mindset, as you will be working with many teams inside and outside of Cloudera.
Experience with benchmark and performance test design. You eshould understand basic concepts of performance testing including different types of performance tests (microbenchmarks, end-to-end benchmarks, concurrency and scale testing), how to reduce (or deal with) noise in test results, etc.
Experience designing performance tests that provide useful insights into specific aspects of performance.
Solid understanding of basic performance theory - in particular a very good understanding of latency, throughput, and concurrency and how they relate to each other.
Strong understanding of the types of workloads they'll be testing Ideally they should have specific experience creating performance tests for the specific product area they'll be working on (SQL, ML, etc).
B.S. or M.S. in Computer Science or equivalent experience.
You might also have:
Experience with the Hadoop ecosystem (i.e. Hive, Impala, Spark), in specific Prior work on largescale data lakehouse or datawarehouse performance
Hands-on experience with containerization, Kubernetes, public cloud infrastructure (AWS, Azure and/or GCP) and mesh-networks
Certifications: CKA/CKAD, AWS Solutions Architect, GCP Cloud Architect, Azure Solutions Architect, or equivalent.
Security & Compliance: Experience writing performance tests that also verify dataprivacy and audit compliance (e.g., GDPR, HIPAA).
Why this role matters:
This is your opportunity to build cloud-native solutions that are deployable anywhere whether in massive clusters on any cloud provider or in private data centers. You'll work with cutting-edge technologies like Trino, Spark, Airflow, and advanced AI inferencing systems to shape the future of analytics. Your code will directly influence how data engineers, analysts, and developers worldwide find value in their data.
We believe in the power of open source. You'll collaborate with project committers, contributing upstream to keep technologies like Apache Hive and Impala evolving. You'll harden these engines for rock-solid security, optimize them for peak performance, and make them effortlessly run across all environments. Join us and help build the trusted, cloud-native platform that powers insights for the most data-intensive companies on the planet.
This position is not eligible for sponsorship.
The expected base salary range for this role in:
California is $124,000 - $155,000
The salary will vary depending on your job-related skills, experience and location.
What you can expect from us:
Generous PTO Policy
Support work life balance with Unplugged Days
Flexible WFH Policy
Mental & Physical Wellness programs
Phone and Internet Reimbursement program
Access to Continued Career Development
Comprehensive Benefits and Competitive Packages
Paid Volunteer Time
Employee Resource Groups
EEO/VEVRAA
#LI-SZ1
#LI-HYBRID
HCLTech is looking for a highly talented, self-motivated and Experienced Firmware Test Engineer to join it in advancing the technological world through innovation and creativity.
Job Title: Firmware Test Engineer
Job ID: 55383
Position Type: Fulltime
Location: Auburn Hills, MI
Core Responsibilities
- Design and develop Basic Software (BSW) and SoC‐level components for automotive ECUs.
- Develop, configure, and optimize Board Support Packages (BSPs) for various automotive SoCs.
- Implement, customize, and optimize low-level drivers for communication interfaces such as UART, SPI, I2C, GPIO, and interrupt controllers.
- Perform embedded OS bring‐up (Linux, FreeRTOS, RTOS), including kernel configuration, device trees, and bootloader customization.
- Conduct SoC-level debugging and issue resolution using tools such as JTAG, GDB, oscilloscopes, and logic analyzers.
- Collaborate with cross-functional automotive teams to ensure robust integration with ADAS, Autonomous Driving, IVI, and safety-critical systems.
- Apply embedded security best practices and support implementation of secure boot, encryption, and authentication mechanisms.
- Work within CI/CD pipelines to automate builds, code analysis, testing, and deployment for embedded software.
- Analyze SoC architectures from vendors like Renesas, TI, Intel, Qualcomm, and tailor BSW design to platform specifications.
Required Qualifications
- Bachelor's degree or higher in Computer Science, Electrical Engineering, or related field (Master's preferred).
- 8+ years of experience in BSW and SoC software design for automotive applications.
- Strong proficiency in C/C++ and embedded programming.
- Proven experience in SoC integration, BSP development, and low-level driver implementation.
- Solid understanding of SoC architectures, peripheral interfaces, and device drivers.
- Experience with embedded operating systems such as Linux, FreeRTOS, RTOS.
- Strong familiarity with tools such as Git, Make/CMake, and debugging tools like JTAG, GDB.
- Excellent communication, analytical thinking, and problem‐solving abilities.
- Experience with autonomous driving platforms or In-Vehicle Infotainment (IVI) architecture is a plus.
- Knowledge of embedded system security (authentication, secure boot, access control).
Pay and Benefits
Pay Range Minimum: $71000 per year
Pay Range Maximum: $108000 per year
HCLTech is an equal opportunity employer, committed to providing equal employment opportunities to all applicants and employees regardless of race, religion, sex, color, age, national origin, pregnancy, sexual orientation, physical disability or genetic information, military or veteran status, or any other protected classification, in accordance with federal, state, and/or local law. Should any applicant have concerns about discrimination in the hiring process, they should provide a detailed report of those concerns to for investigation.
Compensation and Benefits
A candidate's pay within the range will depend on their work location, skills, experience, education, and other factors permitted by law. This role may also be eligible for performance-based bonuses subject to company policies. In addition, this role is eligible for the following benefits subject to company policies: medical, dental, vision, pharmacy, life, accidental death & dismemberment, and disability insurance; employee assistance program; 401(k) retirement plan; 10 days of paid time off per year (some positions are eligible for need-based leave with no designated number of leave days per year); and 10 paid holidays per year.
How You'll Grow
At HCLTech, we offer continuous opportunities for you to find your spark and grow with us. We want you to be happy and satisfied with your role and to really learn what type of work sparks your brilliance the best. Throughout your time with us, we offer transparent communication with senior-level employees, learning and career development programs at every level, and opportunities to experiment in different roles or even pivot industries. We believe that you should be in control of your career with unlimited opportunities to find the role that fits you best.
The Technical Project Manager (TPM) has three main responsibilities:
- Project Manage all technical tasks during implementation and upgrades.
- Install and configure servers and the Care Logistics applications in Amazon Web Services (AWS) and on premise.
- Perform technical operations and oversee availability, performance, and supportability of our observability infrastructure.
The TPM acts as the project manager and liaison between Care Logistics and the customer for all technical activities. The TPM is responsible for coordinating the system configuration, sizing, ordering, and installation while technically engineering and managing the integration of Care Logistics solutions. They work closely with Solutions Delivery and customer resources in support of organizational objectives. Solutions Delivery functions include project delivery tasks such as solution sizing, technical project planning, customer guidance, system installation, system validation, system testing, technical training, and support of technical onsite events. The TPM facilitates DevOps functions between development and the Solutions Delivery teams to ensure technical operations are correctly executed, effectively communicated, and continuously improved.
ESSENTIAL RESPONSIBILITIES:
Solutions Delivery Functions
- Delivery components of customer project tasks which include:
- Assist with the design and implementation of new technologies
- Assist with the sizing of customer systems
- Train new employees on all aspects of the role
- Considered a Subject Matter Expert for all aspects of the technology and project delivery
- Install and troubleshoot software, hardware, and services necessary to support Care Logistics solutions
- Lead the engineering of hospital customer’s technical solutions
- Lead, plan, organize and drive the design, testing, and implementation of Care Logistics software solutions and related advisory services
- Educate customer on technical aspects of the Care Logistics system
- Interface with service and hardware system vendors to build and configure systems
- Participate in onsite customer events, including technical go-live
- Technical Operations and Observability:
- Manage alert and monitoring configuration
- Collect, aggregate, and visualize metrics to provide actionable insights
- Advise right-sizing of AWS infrastructure resources to optimize cost and performance
- Manage incident response
- Provide insight to Cloud Center of Excellence
- Additional tasks which include:
- Provide primary technical support for project team members
- Provide Tier 2 level support for Care Logistics Support team
- Create and maintain internal environments for use by Care Logistics Client Engagement team
- Create Knowledge Base articles and other technical documentation for use by Care Logistics employees and customers
- Define and maintain a clear, concise documented process for the implementation and integration of the system
- Collaborate with teammates to troubleshoot and maintain existing application modules
- Participate in DevOps initiatives to improve products and operations
QUALIFICATIONS – EDUCATION, WORK EXPERIENCE, CERTIFICATIONS:
REQUIRED
- Bachelor’s degree in Computer Information Systems or equivalent experience
- PMP certification and/or equivalent experience
- 2-4 years hands on experience using Amazon Web Services (AWS) services such as EC2, RDS, Systems Manager, VPN, CloudWatch
- 2-4 years of monitoring systems experience using tools such as AWS CloudWatch, Datadog, New Relic, SolarWinds, Dynatrace, etc.
- 4-6 years demonstrated project management experience
- Advanced operation and maintenance of Linux (Red Hat Operating System)
- Demonstrated advanced analytical and troubleshooting skills
- 3+ years integrating software/hardware systems in client-server and cloud environments
- Proven organizational and delivery skills
DESIRED
- AWS certification desired
- Automating and configuring Amazon Web Services (AWS) such as EC2, RDS, VPN
- Operational best practices related to systems operation and maintenance in on-premises and AWS production environments
- Industry standard application/applet containers such as Tomcat
- PostgreSQL and Aurora Databases (installation, configuration, and operation)
- Production High availability server environments
- Complex hardware and software installations
- Management of enterprise reporting tools and/or related technologies
- Project delivery, operations, and support using DevOps and/or Agile methods
- Support leadership experience
- Use of ticketing systems such as JIRA and/or related incident management tools such as OpsGenie
- Comprehension of related scientific and technical journals, abstracts, financial reports, and legal documents.
- Preparation of articles, abstracts, editorials, journals, manuals, and critiques.
- Preparation and delivery of comprehensive presentations, participation in formal debate, extemporaneous communication, and professional communication before an audience.
- Professional certifications in related industry skills such as DBMS, CISSP, ITIL, Agile, and Lean are a plus
KNOWLEDGE, SKILLS, AND ABILITIES:
- Develop strong and productive working relationships with others
- Form strong team bonds and enhance team performance
- Strong organizational and quality management skills with ability to handle multiple, competing tasks and priorities
- Cope with rapidly changing information in a fast-paced environment
- Proven communication, interpersonal, analytical, and organizational skills
- Proven ability to properly communicate with customers (in person and via phone) and manage expectations during a project
- Work both independently and as a member of the implementation and support team
- Manage multiple concurrent activities, all with fluctuating deadlines, by working with other departments, both internal and external
- Quickly identify and resolve issues
- Quickly understand complex concepts
- Excellent oral and written communication skills
- Excellent customer management skills
- Above average observational skills to collect data and validate information
- Outstanding analytical skills with the ability to critically evaluate the information gathered from multiple sources, reconcile conflicts, relate high-level information to details, and distinguish user requests from underlying business problems/needs.
- Effectively represent Jackson Healthcare/Care Logistics values and principles in decision-making and actions
- Support leadership and/or project management
- Excellent troubleshooting skills
- Excellent organizational and delivery skills
- Install, configure, and manage hardware and software in AWS and on-premises environments
- Provide specifications for system hardware and AWS service requirements
- Implement complex system solutions involving multiple technologies
- Control and implement complex system and application feature configurations
- Troubleshoot complex system and technical issues
- Read and understand system and application logs
- Proven ability to communicate and teach complex technical concepts to less technical resources
- Excellent communications and interpersonal skills, as well as analytical and problem-solving skills
- Excellent documentation skills
REQUIRED KNOWLEDGE
- Amazon Web Services (AWS) services such as EC2, RDS, Systems Manager, VPN, CloudWatch
- Monitoring systems such as AWS CloudWatch, Datadog, New Relic, SolarWinds, Dynatrace, etc.
- In-depth knowledge of Linux (Red Hat Operating System) concepts and operations in a production environment
- VMware, Web servers, DBMS, Reporting and analytic tools
- Project Management Methodologies
- Advanced PC knowledge including proficiency with MS Outlook, Word, Excel, and PowerPoint
DESIRED KNOWLEDGE
- Knowledge automating and configuring Amazon Web Services (AWS) such as EC2, RDS, VPN
- Understanding of high availability server environments
- Hardware and software installation techniques
- Healthcare Information Systems
- Enterprise reporting tools
- DevOps and Agile methodologies related to project delivery, operations, and support
- Ticketing systems such as JIRA and related incident management tools (such as OpsGenie)
TRAVEL REQUIREMENTS & WORKING CONDITIONS:
- 10-80% travel required
- The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job
- Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions
- While performing the duties of this job, the employee is frequently required to stand; walk; sit; use hands to finger, handle, or feel; write; type; reach with hands and arms; climb or balance; stoop, kneel, crouch, or crawl; talk or hear; and smell
- The employee must frequently lift and/or move up to 50 pounds
- Specific vision abilities required by this job include close vision, distance vision, color vision, peripheral vision, depth perception, and ability to adjust focus
Role: Cybersecurity Engineer III
Location: MD – Silver Spring, DC, or ATL – Techwood - Onsite
Job Description
Job Responsibilities / Typical Day in the Role
• Implement design reviews to evaluate security controls
• Identify and communicate opportunities to enhance the security posture of WBD
• Build and / or manage enterprise security platforms effectively
• Communicate effectively across all levels of management to articulate WBD security goals and vision.
• Identify and communicate opportunities to enhance the security posture of WBD
• Build and / or manage enterprise security platforms effectively (SAAS, on premise or in Cloud)
• Communicate effectively across all levels of management to articulate WBD security goals and vision.
• Have a team player mentality; strive to contribute to team cohesion however can work independently if the need arises
• Plan, design, engineer and implement security-related technologies
• Understanding technical security issues, their implications within WBD business units and able to effectively communicate them to management and other business leaders.
• Configure, troubleshoot, and maintain security infrastructure – including software and hardware in cloud environments, as well as on-premises.
• Conduct security audits and assessments to regularly determine the effectiveness of security platforms and identify areas of improvement.
• Host and operating systems hardening, auditing, monitoring and logging with appropriate security controls and best practices while meeting security best practices and business goals
• Research and explore emerging security technologies and determine their appropriate use within the company.
• Prepare, document, and create standard operating procedures and protocols.
• Crosstrain and mentor other team members as needed
Must Have Skills / Requirements
1) Implementing advanced cyber security technology in a complex environment
a. 5+ years of experience; Hands-on experience in security engineering, hands-on experience in building, designing, and maintaining enterprise security tools.
2) Scripting experience (using Python, Go, or other equivalent languages)
a. 5+ years of experience.
3) Hands-on Experience with automation technologies
a. 3+ Years of experience; Terraform, Ansible, CloudFormation, etc.
4) Linux Experience.
a. 5+ years of experience; Ability to construct and maintain complex network infrastructures.
Technology requirements:
• Engineer and administer security platforms including SIEM/SOAR systems, endpoint detection and response, vulnerability management, anomaly detection, and cloud analysis.
• Experience in managing the Brinqa vulnerability management platform and experience with Groovy programming language
• Must have 5+ years of scripting experience (using Python or other equivalent languages)
• Hands-on Experience in public cloud infrastructures like AWS (Amazon Web Services)
Nice to Have Skills / Preferred Requirements
1) Security and Cloud certifications are a plus. (CISSP, Splunk Admin, AWS Solution architect).
2) Media/entertainment or distributed global network experience.
Soft Skills
1) Hands-on technical experience with networking and computing system architectures, specifically, the security aspects thereof.
2) Thorough understanding of information security principles, techniques, principles, policy frameworks, and best practices
3) Hands-on technical experience with compliance and regulatory frameworks and how they affect architecture designs and review
Education / Certifications
1) None required, but certifications preferred.
Site Reliability Engineer
Description and Requirements
About Our Team
We are building Quantum, a next‑generation hybrid AI platform that spans Windows, Android, and cloud. As part of this initiative, we are growing the reliability engineering organization that powers cross‑device Personal AI.
We are hiring Site Reliability Engineers (SREs) to strengthen the reliability, observability, and operational excellence of Qira’s AI systems across device, edge, and cloud. Depending on your strengths, you may be aligned to areas such as Observability, Operations, or Service Reliability.
Works with the speed and creativity of a startup inside— you’ll help build foundational systems with clarity, ownership, and modern engineering practices.
Location: On-site in Chicago, IL. Hybrid (3 days on-site, 2 days remote)
What You Might Work On
As an SRE, you may be responsible for a subset of the following, depending on team placement and skill alignment:
Reliability & Systems Engineering
- Support the reliability, availability, and performance of distributed systems across cloud, edge, and device environments.
- Help define, measure, and monitor SLIs and SLOs for core services.
- Identify reliability risks and collaborate with senior engineers on mitigation plans.
Operational Excellence
- Participate in on‑call rotations and assist with incident response and post‑incident reviews.
- Contribute improvements to runbooks, automation, and tooling that reduce alert noise and operational toil.
- Help enhance detection, alerting, and response workflows.
Observability & Insight
- Implement and improve telemetry using OpenTelemetry, Grafana, and related tools.
- Build dashboards and tools that improve visibility into system health and AI service behavior.
- Ensure observability data is complete, accurate, and actionable.
Deployments & Change Safety
- Support safe, reliable deployment workflows including canaries, staged rollouts, and automated rollbacks.
- Assist in improving CI/CD systems and deployment tooling.
Collaboration & Best Practices
- Work closely with senior SREs, DevOps engineers, AI/ML teams, and platform engineers.
- Contribute to reliability reviews, operational readiness checks, and cross‑team projects.
- Advocate for modern SRE and DevOps practices within the organization.
Basic Qualifications
- 4+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or production systems operations.
- Bachelor’s Degree in Computer Science, Engineering, or related technical field (or equivalent practical experience).
- Foundational experience supporting distributed systems in production.
- Ability to write scripts or tools in Python, Go, Bash, or similar languages.
- Solid understanding of Linux systems, networking basics, and system performance fundamentals.
- Experience with cloud platforms (Azure preferred, AWS or GCP acceptable).
- Familiarity with monitoring/observability (metrics, logs, tracing).
- Experience with containers and Kubernetes.
Preferred Qualifications
- Experience with OpenTelemetry instrumentation and telemetry pipelines.
- Hands‑on experience with Grafana, Prometheus, Loki, or Tempo.
- Exposure to AI/ML systems, inference services, or data‑intensive workloads.
- Experience contributing to CI/CD processes and deployment automation.
- Familiarity with hybrid architectures spanning device, edge, and cloud.
- Passion for automation, reliability, and operational excellence.
What Success Looks Like
- Systems become easier to operate, observe, and trust.
- Alerts are more accurate and actionable.
- On‑call load decreases through thoughtful automation and improvements.
- Deployment workflows become more reliable and repeatable.
- You grow toward deeper ownership and technical leadership within the reliability engineering organization.
Location: Database Engineer
Duration: 11-12 months
Location : Austin , TX ( 78759) Hybrid role - In office Mon, Wed, Thurs is a must. (No flexibility on these days)
Job Description:
The Cassandra Database Engineer is an expert across NOSQL database technologies, but specifically a specialist on Cassandra database administration.
For this position, NOSQL database expertise is mandatory with a primary focus on Cassandra databases, as well as expertise in Public Cloud technology (AWS and/or GCP).
For this mission, the engineer will primarily be responsible for database operational activities.
Essential Functions / Key Areas of Responsibility
The Database Engineer primary responsibility footprint:
· Database performance analysis and operations review for production database platforms
· Manage database operations activities including incident response, database alert resolution, and managing third party support engagement
· Deploy and maintain database monitoring solutions.
· Test and build database restore and recovery procedures
· Database platform deployment, installation, patching, change management, and third-party software upgrades.
· Responsible for database hardening procedure identification and deployment on public cloud, hosted, and on-premises platforms.
· Responsible for providing database expertise and operations support to the technical support teams and project delivery teams.
· Responsible for participating in database platform review, bench and tuning exercises, security evaluation, provide technical analysis and proactive recommendations for improvements and/or design changes for production platforms
Minimum Requirements: Skills, Experience & Education
· HS diploma with 8+ experience in Cassandra administration (NOT architecture or design)
· College degree in Computer Science preferred + 8-10 years’ experience
· NOSQL Database: 8-10 years Cassandra administration
· Extensive background with public cloud database deployment, management and migration.
· Expertise in database concepts, defining standards, processes, and procedures in database deployment methodologies
· Expert in operations of high-profile production database platforms with high SLA and high-performance expectation
· High level of experience in managing change on production database platform on hosted, on premise, and cloud database platforms
· Expert in deploying high availability database architectures
· Proactive, team player, and leadership qualities with strong technical background
· Excellent verbal and written communication skills
Preferred Qualifications
· Highly skilled in Cassandra database administration
· DataStax enterprise Cassandra administration a plus
· Strong production operations and troubleshooting skills
· Linux operating system background
· Skilled in Public Cloud deployment methods/tools (Gitlab, Terraform, Datadog)
· Knowledge of Kubernetes and Docker.
· Database performance evaluation and platform bench participation
Special Position Requirements:
Candidate will need to be able to multitask and quickly switch if needed to work on emergency incidents on production platforms. The position requires the ability to be able to manage tight deadlines and have visibility on project delivery goals and the ability to communicate effectively to project teams and management. The candidate will be able to thrive in fast paced work environment.
- Looking for a candidate that is currently in the position of maintaining Cassandra clusters today (avoid those that have worked in past, or a couple years ago...)
- How many clusters are maintained today
- How many nodes
- What Cassandra version are they
- How many years have you worked on Cassandra (ideally 5+)
- Candidate has operations experience and can speak to challenges in his environment today
- manages patching / upgrades
- is called upon in crisis to manage
- delivers new environments
- Performance tuning experience with Cassandra
- familiar with backup and recovery
- Familiar with monitoring Cassandra (Prometheus or Datadog a plus)
- is go to for other teams on Cassandra database topics
- Candidate is adaptable to work in fast paced environment, context switching is normal
- Candidate is ok to be in stressful/challenging situations
- Outages
- Crises team
- War room
Position Summary:
The Scientific Computing and Data group at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery. To achieve these aims, we support a cutting-edge high-performance computing and data ecosystem along with MD/PhD-level support for researchers. The group is composed of a high-performance computing team, a clinical data warehouse team and a data services team.
The Lead HPC Architect, Cybersecurity, High Performance Computational and Data Ecosystem, is responsible for designing, implementing, and managing the cybersecurity infrastructure and technical operations of Scientific Computing’s computational and data science ecosystem. This ecosystem includes a 25,000+ core and 40+ petabyte usable high-performance computing (HPC) systems, clinical research databases, and a software development infrastructure for local and national projects. The HPC system is the fastest in the world at any academic biomedical center (Top 500 list).
To meet Sinai’s scientific and clinical goals, the Lead brings a strategic, tactical and customer-focused vision to evolve the ecosystem to be continually more resilient, secure, scalable and productive for basic and translational biomedical research. The Lead combines deep technical expertise in cybersecurity, HPC systems, storage, networking, and software infrastructure with a strong focus on service, collaboration, and strategic planning for researchers and clinicians throughout the organization and beyond. The Lead is an expert troubleshooter, productive partner and leader of projects. The lead will work with stakeholders to make sure the HPC infrastructure is in compliance with governmental funding agency requirements and to promote efficient resource utilizations for researchers
This position reports to the Director for HPC and Data Ecosystem in Scientific Computing and Data.
Key Responsibilities:
HPC Cybersecurity & System Administration:
- Design, implement, and manage all cybersecurity operations within the HPC environment, ensuring alignment with industry standards (NIST, ISO, GDPR, HIPAA, CMMC, NYC Cyber Command, etc.).
- Implement best practices for data security, including but not limited to encryption (at rest, in transit, and in use), audit logging, access control, authentication control, configuration managements, secure enclaves, and confidential computing.
- Perform full-spectrum HPC system administration: installation, monitoring, maintenance, usage reporting, troubleshooting, backup and performance tuning across HPC applications, web service, database, job scheduler, networking, storage, computes, and hardware to optimize workload efficiency.
- Lead resolution of complex cybersecurity and system issues; provide mentorship and technical guidance to team members.
- Ensure that all designs and implementations meet cybersecurity, performance, scalability, and reliability goals. Ensure that the design and operation of the HPC ecosystem is productive for research.
- Lead the integration of HPC resources with laboratory equipment for data ingestion aligned with all regulatory such as genomic sequencers, microscopy, clinical system etc.
- Develop, review and maintain security policies, risk assessments, and compliance documentation accurately and efficiently.
- Collaborate with institutional IT, compliance, and research teams to ensure all regulatory, Sinai Policy and operational alignment.
- Design and implement hybrid and cloud-integrated HPC solutions using on-premise and public cloud resources.
- Partner with other peers regionally, nationally and internationally to discover, propose and deploy a world-class research infrastructure for Mount Sinai.
- Stay current with emerging HPC, cloud, and cybersecurity technologies to keep the organization’s infrastructure up-to-date.
- Work collaboratively, effectively and productively with other team members within the group and across Mount Sinai.
- Provide after-hours support as needed.
- Perform other duties as assigned or requested.
Requirements:
- Bachelor’s degree in computer science, engineering or another scientific field. Master's or PhD preferred.
- 10 years of progressive HPC system administration experience with Enterprise Linux releases including RedHat/CentOS/Rocky Systems, and batch cluster environment.
- Experience with all aspects of high-throughput HPC including schedulers (LSF or Slurm), networking (Infiniband/Gigabit Ethernet), parallel file systems and storage, configuration management systems (xCAT, Puppet and/or Ansible), etc.
- Proficient in cybersecurity processes, posture, regulations, approaches, protocols, firewalls, data protection in a regulated environment (e.g. finance, healthcare).
- In-depth knowledge HIPAA, NIST, FISMA, GDPR and related compliance standards, with prove experience building and maintaining compliant HPC system
- Experience with secure enclaves and confidential computing.
- Proven ability to provide mentorship and technical leadership to team members.
- Proven ability to lead complex projects to completion in collaborative, interdisciplinary settings with minimum guidance.
- Excellent analytical ability and troubleshooting skills.
- Excellent communication, documentation, collaboration and interpersonal skills. Must be a team player and customer focused.
- Scripting and programming experience.
Preferred Experience
- Proficient with cloud services, orchestration tools, openshift/Kubernetes cost optimization and hybrid HPC architectures.
- Experience with Azure, AWS or Google cloud services.
- Experience with LSF job scheduler and GPFS Spectrum Scale.
- Experience in a healthcare environment.
- Experience in a research environment is highly preferred.
- Experience with software that enables privacy-preserving linking of PHI.
- Experience with Globus data transfer.
- Experience with Web service, SAP HANA, Oracle, SQL, MariaDB and other database technologies.
Strength through Unity and Inclusion
The Mount Sinai Health System is committed to fostering an environment where everyone can contribute to excellence. We share a common dedication to delivering outstanding patient care. When you join us, you become part of Mount Sinai’s unparalleled legacy of achievement, education, and innovation as we work together to transform healthcare. We encourage all team members to actively participate in creating a culture that ensures fair access to opportunities, promotes inclusive practices, and supports the success of every individual.
At Mount Sinai, our leaders are committed to fostering a workplace where all employees feel valued, respected, and empowered to grow. We strive to create an environment where collaboration, fairness, and continuous learning drive positive change, improving the well-being of our staff, patients, and organization. Our leaders are expected to challenge outdated practices, promote a culture of respect, and work toward meaningful improvements that enhance patient care and workplace experiences. We are dedicated to building a supportive and welcoming environment where everyone has the opportunity to thrive and advance professionally. Explore this opportunity and be part of the next chapter in our history.
About the Mount Sinai Health System:
Mount Sinai Health System is one of the largest academic medical systems in the New York metro area, with more than 48,000 employees working across eight hospitals, more than 400 outpatient practices, more than 300 labs, a school of nursing, and a leading school of medicine and graduate education. Mount Sinai advances health for all people, everywhere, by taking on the most complex health care challenges of our time — discovering and applying new scientific learning and knowledge; developing safer, more effective treatments; educating the next generation of medical leaders and innovators; and supporting local communities by delivering high-quality care to all who need it. Through the integration of its hospitals, labs, and schools, Mount Sinai offers comprehensive health care solutions from birth through geriatrics, leveraging innovative approaches such as artificial intelligence and informatics while keeping patients’ medical and emotional needs at the center of all treatment. The Health System includes more than 9,000 primary and specialty care physicians; 13 joint-venture outpatient surgery centers throughout the five boroughs of New York City, Westchester, Long Island, and Florida; and more than 30 affiliated community health centers. We are consistently ranked by U.S. News & World Report's Best Hospitals, receiving high "Honor Roll" status.
Equal Opportunity Employer
The Mount Sinai Health System is an equal opportunity employer, complying with all applicable federal civil rights laws. We do not discriminate, exclude, or treat individuals differently based on race, color, national origin, age, religion, disability, sex, sexual orientation, gender, veteran status, or any other characteristic protected by law. We are deeply committed to fostering an environment where all faculty, staff, students, trainees, patients, visitors, and the communities we serve feel respected and supported. Our goal is to create a healthcare and learning institution that actively works to remove barriers, address challenges, and promote fairness in all aspects of our organization.