Alibaba Cloud Linux 3 Centos Jobs in Usa
15,716 positions found — Page 2
How will you make an impact?
- Jabil is seeking a Software Test Development Engineer who will directly contribute to the transformative growth within our Global cloud Test Development (GCTD) team in the Cloud Enterprise and Intelligent Infrastructure (CE&I) division by applying unique and innovative approaches to solving problems within a large-scale software production environment.
- As the Software Test Development Engineer plays a vital role in ensuring the quality and reliability of hardware products, contributing to the overall success of the manufacturing process and customer satisfaction.
- You will be responsible for contributing to the end-to-end architecture, definition, development and production deployment of production software applications and infrastructure spanning multiple customers and manufacturing regions.
- You will also be responsible for interfacing with internal engineering, manufacturing and quality teams and our end customers to ensure your software deliverables meet the rigorous standards of Jabil’s world-class manufacturing environments.
What will you do?
Test System Development:
- Design and develop test systems and procedures for manufacturing processes. This includes creating test plans, test cases, and test scripts to assess the functionality and performance of hardware components or devices such as:
- Motherboard,
- Memory,
- CPU,
- Storage (SSD, HDD, NVMe) and
- PCIE devices (NIC, GPU, Mezz cards, RAID cards)
Software Development Test:
- Create, validate, release, and maintain test software and scripts that automate the testing process.
- This software may include code for controlling test equipment, collecting and analyzing data, and generating test reports.
Sustaining Test:
- Support and maintenance for the manufacturing server (L10) and rack (L11) level test software and infrastructure deployed at our production facilities, including the implementation of minor system configuration changes (new IPNs).
Test Infrastructure Expansions:
- Support the site’s manufacturing server (L10) and rack (L11) current test infrastructure as well as future expansions planning, deployments, and assembly.
Debugging and Troubleshooting:
- Diagnose and resolve issues with test software, or hardware components (servers, switches, racks, L10, L12) that may arise during the manufacturing process.
Documentation:
- Maintain comprehensive manufacturing server (L10) and rack (L11) documentation of test procedures, specifications, and Infrastructure.
Collaboration:
- Work closely with cross-functional teams, including hardware engineers, manufacturing engineers, and quality assurance personnel, to ensure alignment of testing requirements and quality standards.
Continuous Learning:
- Stay updated on the latest advancements in testing technologies, methodologies, and industry best practices to keep manufacturing processes competitive and up to date.
- Definition and collaboration on overall test infrastructure and application architectures.
Management & Supervisory Responsibilities
- Reports to Management
Education:
- BS degree in Electrical/Computer Engineering, Computer Science or related field. MS preferred
Experience:
- 5 years’ experience in software manufacturing test development/sustaining with enterprise servers, storage or networking products.
- Experience in the following programming/scripting languages:
- Python,
- Java,
- BASH,
- C, C++, experience a plus
- Linux development experience with a solid understanding of its fundamentals:
- CentOS
- Ubuntu
- Experience with hardware and API solutions for controlling, managing and stressing L10 devices (servers, network and storage SSDs, NVMe):
- IPMI,
- Redfish,
- mprime,
- FIO,
- Linpack,
- ptugen,
- memtester
- Familiarity in the creation and configuration (DHCP, PXE boot, nginx) of Virtual Machines (VMs) using VMWare.
- Expertise with leading edge networking systems, hardware, software and protocols including but not limited to enterprise ethernet datacenter switching/routing L1, L2, and L3 (BGP, DHCP Relay, ECMP)
- Arista CloudVision is a plus.
- Experience with code versioning tools (Git preferred).
- Knowledge of professional software engineering practices for the complete software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
- Excellent verbal and written communication skills.
- Experience working in multi-site and multi-cultural environments.
What Can Jabil Offer You?
Along with growth, stability, and the opportunity to be challenged, Jabil offers a competitive benefits package that includes:
- Medical, Dental, Prescription Drug, and Vision Insurance with HRA and HSA options
- 401K Match
- Employee Stock Purchase Plan
- Paid Time Off
- Tuition Reimbursement
- Life, AD&D, and Disability Insurance
- Commuter Benefits
- Employee Assistance Program
- Pet Insurance
- Adoption Assistance
- Annual Merit Increases
- Community Volunteer Opportunities
How will you make an impact?
- As a Site Reliability Technician within Jabil’s Cloud Test Software Development team, you will directly contribute to the daily operations and development of our Cloud Test Platform Infrastructure deployed at multiple production facilities worldwide.
Working Hours:
- 12 HOUR SHIFT
- 6 AM TO 6 PM
- Sunday, Monday, Tuesday, Alternating Wednesdays
What will you do?
- As the Site Reliability Technician, you will provide the first line response to production issues including but not limited to outages, end user performance, change management, monitoring, improving the efficiency and usability of production test infrastructure and applications, and ensuring all site test infrastructure software and hardware is maintained with the latest updates to ensure high levels of performance and reliability.
How will you get here?
- Sustaining support and maintenance for the manufacturing server (L10) and rack (L11-L12) level test software and infrastructure deployed at our production facilities.
- Support the site’s manufacturing server (L10) and rack (L11-L12) current test infrastructure as well as future expansion planning, deployments, and assembly.
- Maintain manufacturing server (L10) and rack (L11-L12) test infrastructure documentation of installations, upgrades, and management.
- Communicate manufacturing test infrastructure enhancements while providing insights based on site operations and uptime challenges.
- Support manufacturing test incident response, analysis, and corrective actions for the site operations.
- Participate in closed loop analysis/responses to factory test failures.
- Perform scheduled preventive maintenance on the test infrastructure, including MDF, IDF, and SUT TORs
Experience:
- Experience in the following programming/scripting languages:
- Python,
- Java,
- BASH,
- C, C++, experience a plus
- Understanding of Linux fundamentals:
- CentOS
- Ubuntu
- Familiarity with hardware and API solutions for controlling, managing and stressing L10 devices (servers, network and storage SSDs, NVMe):
- IPMI,
- Redfish,
- mprime,
- FIO,
- Linpack,
- ptugen,
- memtester
- Familiarity in the creation and configuration (DHCP, PXE boot, nginx) of Virtual Machines (VMs) using VMWare is a plus.
- Experience with leading edge networking systems, hardware, software, and protocols including but not limited to enterprise ethernet datacenter switching/routing L1, L2, and L3 (BGP, DHCP Relay, ECMP). Arista CloudVision is a plus.
- Experience with networking systems, hardware, software, and protocols including but not limited to enterprise ethernet datacenter switching/routing (L1 – L3). · Demonstrated systematic problem-solving capability, coupled with effective communication skills and a sense of ownership and drive.
What Can Jabil Offer You?
Along with growth, stability, and the opportunity to be challenged, Jabil offers a competitive benefits package that includes:
- Medical, Dental, Prescription Drug, and Vision Insurance with HRA and HSA options
- 401K Match
- Employee Stock Purchase Plan
- Paid Time Off
- Tuition Reimbursement
- Life, AD&D, and Disability Insurance
- Commuter Benefits
- Employee Assistance Program
- Pet Insurance
- Adoption Assistance
- Annual Merit Increases
- Community Volunteer Opportunities
How will you make an impact?
- Jabil is seeking a Sr. Manufacturing Cloud Test Development Engineer who will directly contribute to the transformative growth within our Enterprise and Infrastructure division by applying unique and innovative approaches to solving problems within a large-scale software production environment.
- The Software Test Development Engineer plays a vital role in ensuring the quality and reliability of hardware products, contributing to the overall success of the manufacturing process and customer satisfaction.
- You will be responsible for contributing to the end-to-end architecture, definition, development and production deployment of production software applications and infrastructure spanning multiple customers and manufacturing regions.
- As the Sr. Manufacturing Cloud Test Development Engineer, you will also be responsible for interfacing internal engineering, manufacturing and quality teams and our end customers to ensure your software deliverables meet the rigorous standards of Jabil’s world-class manufacturing environments.
What will you do?
Test System Development:
- Design and develop test systems and procedures for manufacturing processes. This includes creating test plans, test cases, and test scripts to assess the functionality and performance of hardware components or devices such as
- motherboard,
- memory,
- CPU, storage (SSD, HDD, NVMe) and
- PCIE devices (NIC, GPU, Mezz cards, RAID cards)
Test Software Development:
- Create, validate, release, and maintain test software and scripts that automate the testing process. This software may include code for controlling test equipment, collecting, and analyzing data, and generating test reports.
Sustaining Test:
- Support and maintenance for the manufacturing server (L10) and rack (L11) level test software and infrastructure deployed at our production facilities, including the implementation of minor system configuration changes (new IPNs).
Documentation:
- Maintain comprehensive manufacturing server (L10) and rack (L11) documentation of test procedures, specifications, and Infrastructure.
Collaboration:
- Work closely with cross-functional teams, including hardware engineers, manufacturing engineers, and quality assurance personnel, to ensure alignment on testing requirements and quality standards.
Continuous Learning:
- Stay updated on the latest advancements in testing technologies, methodologies, and industry best practices to keep manufacturing processes competitive and up to date.
- Definition and collaboration on overall test infrastructure and application architectures.
Management & Supervisory Responsibilities
- Reports to Management
How will you get here?
- Expertise in the following programming/scripting languages:
- Python,
- BASH,
- C, C++, experience a plus
- Linux development expertise with a solid understanding of its fundamentals:
- CentOS
- Ubuntu
- Expertise with hardware and API solutions for controlling, managing, and stressing L10 devices (servers, network, and storage SSDs, NVMe):
- IPMI,
- Redfish,
- mprime,
- FIO,
- Linpack,
- ptugen,
- memtester
- Expertise in the creation and configuration (DHCP, PXE boot, nginx) of Virtual Machines (VMs), VMWare preferred.
- Expertise with leading edge networking systems, hardware, software, and protocols including but not limited to enterprise ethernet datacenter switching/routing L1, L2, and L3 (BGP, DHCP Relay, ECMP). Arista CloudVision is a plus.
- Experience with code versioning tools (Git preferred).
- Strong knowledge of professional software engineering practices for the complete software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
Education:
- BS degree in Electrical/Computer Engineering, Computer Science, or related field. MS preferred.
Experience:
- 5-8 years’ experience in software manufacturing test development/sustaining with enterprise server, storage, or networking products.
- Excellent verbal and written communication skills.
- Experience working in multi-site and multi-cultural environments.
- Domestic and/or international travel, up to 10%, may be required.
What Can Jabil Offer You?
Along with growth, stability, and the opportunity to be challenged, Jabil offers a competitive benefits package that includes:
- Medical, Dental, Prescription Drug, and Vision Insurance with HRA and HSA options
- 401K Match
- Employee Stock Purchase Plan
- Paid Time Off
- Tuition Reimbursement
- Life, AD&D, and Disability Insurance
- Commuter Benefits
- Employee Assistance Program
- Pet Insurance
- Adoption Assistance
- Annual Merit Increases
- Community Volunteer Opportunities
How will you make an impact?
- Jabil is seeking a Sr. Manufacturing Cloud Test Development Engineer who will directly contribute to the transformative growth within our Enterprise and Infrastructure division by applying unique and innovative approaches to solving problems within a large-scale software production environment.
- The Software Test Development Engineer plays a vital role in ensuring the quality and reliability of hardware products, contributing to the overall success of the manufacturing process and customer satisfaction.
- You will be responsible for contributing to the end-to-end architecture, definition, development and production deployment of production software applications and infrastructure spanning multiple customers and manufacturing regions.
- As the Sr. Manufacturing Cloud Test Development Engineer, you will also be responsible for interfacing internal engineering, manufacturing and quality teams and our end customers to ensure your software deliverables meet the rigorous standards of Jabil’s world-class manufacturing environments.
What will you do?
Test System Development:
- Design and develop test systems and procedures for manufacturing processes. This includes creating test plans, test cases, and test scripts to assess the functionality and performance of hardware components or devices such as
- motherboard,
- memory,
- CPU, storage (SSD, HDD, NVMe) and
- PCIE devices (NIC, GPU, Mezz cards, RAID cards)
Test Software Development:
- Create, validate, release, and maintain test software and scripts that automate the testing process. This software may include code for controlling test equipment, collecting, and analyzing data, and generating test reports.
Sustaining Test:
- Support and maintenance for the manufacturing server (L10) and rack (L11) level test software and infrastructure deployed at our production facilities, including the implementation of minor system configuration changes (new IPNs).
Documentation:
- Maintain comprehensive manufacturing server (L10) and rack (L11) documentation of test procedures, specifications, and Infrastructure.
Collaboration:
- Work closely with cross-functional teams, including hardware engineers, manufacturing engineers, and quality assurance personnel, to ensure alignment on testing requirements and quality standards.
Continuous Learning:
- Stay updated on the latest advancements in testing technologies, methodologies, and industry best practices to keep manufacturing processes competitive and up to date.
- Definition and collaboration on overall test infrastructure and application architectures.
Management & Supervisory Responsibilities
- Reports to Management
How will you get here?
- Expertise in the following programming/scripting languages:
- Python,
- BASH,
- C, C++, experience a plus
- Linux development expertise with a solid understanding of its fundamentals:
- CentOS
- Ubuntu
- Expertise with hardware and API solutions for controlling, managing, and stressing L10 devices (servers, network, and storage SSDs, NVMe):
- IPMI,
- Redfish,
- mprime,
- FIO,
- Linpack,
- ptugen,
- memtester
- Expertise in the creation and configuration (DHCP, PXE boot, nginx) of Virtual Machines (VMs), VMWare preferred.
- Expertise with leading edge networking systems, hardware, software, and protocols including but not limited to enterprise ethernet datacenter switching/routing L1, L2, and L3 (BGP, DHCP Relay, ECMP). Arista CloudVision is a plus.
- Experience with code versioning tools (Git preferred).
- Strong knowledge of professional software engineering practices for the complete software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
Education:
- BS degree in Electrical/Computer Engineering, Computer Science, or related field. MS preferred.
Experience:
- 5-8 years’ experience in software manufacturing test development/sustaining with enterprise server, storage, or networking products.
- Excellent verbal and written communication skills.
- Experience working in multi-site and multi-cultural environments.
- Domestic and/or international travel, up to 10%, may be required.
What Can Jabil Offer You?
Along with growth, stability, and the opportunity to be challenged, Jabil offers a competitive benefits package that includes:
- Medical, Dental, Prescription Drug, and Vision Insurance with HRA and HSA options
- 401K Match
- Employee Stock Purchase Plan
- Paid Time Off
- Tuition Reimbursement
- Life, AD&D, and Disability Insurance
- Commuter Benefits
- Employee Assistance Program
- Pet Insurance
- Adoption Assistance
- Annual Merit Increases
- Community Volunteer Opportunities
Jabil is a product solutions company providing comprehensive design, manufacturing, supply chain and product management services. Operating from over 100 facilities in 29 countries, Jabil delivers innovative, integrated, and tailored solutions to customers across a broad range of industries and end-markets, such as automotive, consumer lifestyle and wearable tech, defense and aerospace, connected home and building, industrial and energy, enterprise and infrastructure, healthcare, mobility, packaging and printing.
How will you make an impact?
Jabil is seeking a Software Test Development Engineer who will directly contribute to the transformative growth within our Global cloud Test Development (GCTD) team in the Cloud Enterprise and Intelligent Infrastructure (CE&I) division by applying unique and innovative approaches to solving problems within a large-scale software production environment.
As the Software Test Development Engineer plays a vital role in ensuring the quality and reliability of hardware products, contributing to the overall success of the manufacturing process and customer satisfaction.
You will be responsible for contributing to the end-to-end architecture, definition, development and production deployment of production software applications and infrastructure spanning multiple customers and manufacturing regions.
You will also be responsible for interfacing with internal engineering, manufacturing and quality teams and our end customers to ensure your software deliverables meet the rigorous standards of Jabil’s world-class manufacturing environments.
What will you do?
Test System Development:
Design and develop test systems and procedures for manufacturing processes. This includes creating test plans, test cases, and test scripts to assess the functionality and performance of hardware components or devices such as:
Motherboard,
Memory,
CPU,
Storage (SSD, HDD, NVMe) and
PCIE devices (NIC, GPU, Mezz cards, RAID cards)
Software Development Test:
Create, validate, release, and maintain test software and scripts that automate the testing process.
This software may include code for controlling test equipment, collecting and analyzing data, and generating test reports.
Sustaining Test:
Support and maintenance for the manufacturing server (L10) and rack (L11) level test software and infrastructure deployed at our production facilities, including the implementation of minor system configuration changes (new IPNs).
Test Infrastructure Expansions:
Support the site’s manufacturing server (L10) and rack (L11) current test infrastructure as well as future expansions planning, deployments, and assembly.
Debugging and Troubleshooting:
Diagnose and resolve issues with test software, or hardware components (servers, switches, racks, L10, L12) that may arise during the manufacturing process.
Documentation:
Maintain comprehensive manufacturing server (L10) and rack (L11) documentation of test procedures, specifications, and Infrastructure.
Collaboration:
Work closely with cross-functional teams, including hardware engineers, manufacturing engineers, and quality assurance personnel, to ensure alignment of testing requirements and quality standards.
Continuous Learning:
Stay updated on the latest advancements in testing technologies, methodologies, and industry best practices to keep manufacturing processes competitive and up to date.
Definition and collaboration on overall test infrastructure and application architectures.
Management & Supervisory Responsibilities
Reports to Management
Education:
BS degree in Electrical/Computer Engineering, Computer Science or related field. MS preferred
Experience:
5 years’ experience in software manufacturing test development/sustaining with enterprise servers, storage or networking products.
Experience in the following programming/scripting languages:
Python,
Java,
BASH,
C, C++, experience a plus
Linux development experience with a solid understanding of its fundamentals:
CentOS
Ubuntu
Experience with hardware and API solutions for controlling, managing and stressing L10 devices (servers, network and storage SSDs, NVMe):
IPMI,
Redfish,
mprime,
FIO,
Linpack,
ptugen,
memtester
Familiarity in the creation and configuration (DHCP, PXE boot, nginx) of Virtual Machines (VMs) using VMWare.
Expertise with leading edge networking systems, hardware, software and protocols including but not limited to enterprise ethernet datacenter switching/routing L1, L2, and L3 (BGP, DHCP Relay, ECMP)
Arista CloudVision is a plus.
Experience with code versioning tools (Git preferred).
Knowledge of professional software engineering practices for the complete software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
Excellent verbal and written communication skills.
Experience working in multi-site and multi-cultural environments.
What Can Jabil Offer You?
Along with growth, stability, and the opportunity to be challenged, Jabil offers a competitive benefits package that includes:
Medical, Dental, Prescription Drug, and Vision Insurance with HRA and HSA options
401K Match
Employee Stock Purchase Plan
Paid Time Off
Tuition Reimbursement
Life, AD&D, and Disability Insurance
Commuter Benefits
Employee Assistance Program
Pet Insurance
Adoption Assistance
Annual Merit Increases
Community Volunteer Opportunities
Compensation: $150-195k Responsibilities: • Design, deploy, and manage container orchestration platforms using OpenShift and AKS.
• Administer and optimize Linux-based systems in hybrid and multi-cloud environments.
• Automate infrastructure provisioning and configuration using Ansible Automation Platform.
• Develop and maintain Infrastructure as Code (IaC) using Terraform, Helm, and GitOps workflows.
• Collaborate with DevOps and application teams to implement CI/CD pipelines and DevSecOps practices.
• Monitor system performance, troubleshoot issues, and ensure high availability and disaster recovery.
• Implement security best practices for containerized workloads and cloud environments.
• Provide technical leadership and mentorship to junior engineers.
• Stay current with emerging technologies and contribute to strategic cloud initiatives.
• Assist with migrations to cloud, ensuring best practices are followed and architecture is compliant with company standards.
Qualifications: Required: • Bachelor's degree in computer science, Engineering, or related field (or equivalent experience).
• 5+ years of professional experience in Linux system administration and cloud engineering.
• 3+ years of hands-on experience with OpenShift and AKS in production environments.
• Strong proficiency in scripting languages (e.g., Bash, Python).
• Experience with CI/CD tools (e.g., Jenkins, GitLab CI, ArgoCD).
• Deep understanding of Kubernetes architecture, networking, and security.
• Familiarity with cloud platforms (Azure, AWS, GCP) and hybrid cloud strategies.
• Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK stack).
• Excellent problem-solving and communication skills.
• Linux Administration: Deep expertise in RHEL environment.
• Container Platforms: 3+ years of hands-on experience with OpenShift and AKS.
• Automation: Proficiency with Ansible, Ansible Tower/AAP, and scripting (Bash, Python).
• Infrastructure as Code: Experience with Terraform, Helm, and GitOps tools (e.g., ArgoCD, Flux).
• CI/CD: Familiarity with Jenkins, GitLab CI, Azure DevOps, or similar tools.
• Cloud Platforms: Strong knowledge of Azure, with exposure to AWS or GCP a plus.
• Monitoring & Logging: Experience with Prometheus, Grafana, ELK/EFK, and Azure Monitor.
• Security: Understanding of container security, RBAC, network policies, and compliance frameworks.
• Networking: Solid grasp of Kubernetes networking, service mesh (e.g., Istio), and ingress controllers.
Preferred: • Red Hat Certified Specialist in OpenShift Administration.
• Microsoft Certified: Azure Kubernetes Service Specialist.
• Experience with service mesh technologies (e.g., Istio, Linkerd).
• Experience in regulated industries (e.g., finance, healthcare) is a plus.
Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.
About This Role:
Crusoe Cloud is revolutionizing high-performance computing by offering sustainable, low-cost GPU compute power. As a Senior Cloud Support Engineer, you'll play a crucial role in empowering our customers to leverage this technology for groundbreaking advancements in fields like AI/ML, physics simulations, and computational biology. You will be the primary point of contact for technical support, ensuring our customers can seamlessly utilize Crusoe Cloud to achieve their goals. This role directly impacts Crusoe's mission by enabling our customers to accelerate their research and development, contributing to a more sustainable future. You will be involved in exciting projects, working with cutting-edge technologies and collaborating with a talented team to solve complex challenges. The ideal candidate is a highly motivated and experienced technical professional with a passion for customer success, a deep understanding of cloud technologies, and a commitment to Crusoe's values. This is a full-time position.
What You’ll Be Working On:
- Customer Support: Provide exceptional technical support to customers via Zendesk, meeting SLAs and maintaining high CSAT (95%+).
- On-Call Rotation: Participate in a 24/7 on-call rotation to ensure timely resolution of critical issues.
- Troubleshooting: Diagnose and resolve issues related to VMs, hardware failures, and scaling tests using CLI and internal tools.
- Alert Triage and Maintenance: Manage alert triage, prepare for maintenance windows, and conduct node delivery testing.
- Collaboration: Work closely with SRE, Networking, and Storage teams from initial triage to root cause analysis (RCA) delivery.
- Global Teamwork: Adhere to global team collaboration and handoff processes for ticketing and on-call procedures.
- Knowledge Sharing: Develop onboarding/training materials, knowledge base documentation, and standard operating procedures (SOPs).
What You’ll Bring to the Team:
- Education/Experience: Bachelor's degree in IT, Computer Science, Engineering, or a related field, or 4+ years of equivalent technical experience.
- Linux Proficiency: Strong command-line interface (CLI) skills in Linux environments.
- Version Control: Proficiency with Git for code management and collaboration.
- Customer Support Experience: 5+ years of experience in a customer support role, ideally within cloud, storage, or networking environments.
- Cloud Technologies: Experience with container orchestration (e.g., Kubernetes), workload management (e.g., Slurm, Terraform), and monitoring tools (e.g., Grafana).
- Public Cloud Knowledge: Familiarity with other public cloud platforms (e.g., AWS, Azure, GCP).
- Communication Skills: Excellent communication and customer service skills, including the ability to prioritize competing escalations.
- HPC Knowledge: Understanding of HPC technologies such as Infiniband, RDMA, RoCE, and Software Defined Networking (SDN).
Bonus Points:
- Certifications: CKA, CKAD, CKS, KCNA, AWS Machine Learning - Specialty, Data Analytics - Specialty, Solutions Architect - Professional, Developer - Associate, NVIDIA AI Infrastructure and Operations, Generative AI and LLMs, Generative AI Multi-modal, Infiniband, Linux Foundation IT Associate, System Administrator.
- Cloud Expertise: Deep understanding of specific cloud platforms and services.
- Automation Skills: Experience with automation tools and scripting languages.
- Problem-Solving Abilities: Demonstrated ability to analyze complex technical issues and develop effective solutions.
- Collaboration and Mentorship: Proven ability to mentor, train, and onboard colleagues.
- Passion for Sustainability: A strong interest in contributing to a more sustainable future through technology.
Benefits:
- Industry competitive pay
- Restricted Stock Units in a fast growing, well-funded technology company
- Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
- Employer contributions to HSA accounts
- Paid Parental Leave
- Paid life insurance, short-term and long-term disability
- Teladoc
- 401(k) with a 100% match up to 4% of salary
- Generous paid time off and holiday schedule
- Cell phone reimbursement
- Tuition reimbursement
- Subscription to the Calm app
- MetLife Legal
- Company paid commuter benefit; $300 per pay period
Compensation:
Compensation will be paid between $125,000 and $151,000 + Bonus. Restricted Stock Units are included in all offers. Salary will be determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
Role: Cyber Security Architect – Linux, Ansible & Terraform
Location: Silver Spring, MD , DC, Techwood, ATL – Onsite
Job Responsibilities / Typical Day in the Role
• Implement design reviews to evaluate security controls
• Identify and communicate opportunities to enhance the security posture of WBD
• Build and / or manage enterprise security platforms effectively
• Communicate effectively across all levels of management to articulate WBD security goals and vision.
• Identify and communicate opportunities to enhance the security posture of WBD
• Build and / or manage enterprise security platforms effectively (SAAS, on premise or in Cloud)
• Communicate effectively across all levels of management to articulate WBD security goals and vision.
• Have a team player mentality; strive to contribute to team cohesion however can work independently if the need arises
• Plan, design, engineer and implement security-related technologies
• Understanding technical security issues, their implications within WBD business units and able to effectively communicate them to management and other business leaders.
• Configure, troubleshoot, and maintain security infrastructure – including software and hardware in cloud environments, as well as on-premises.
• Conduct security audits and assessments to regularly determine the effectiveness of security platforms and identify areas of improvement.
• Host and operating systems hardening, auditing, monitoring and logging with appropriate security controls and best practices while meeting security best practices and business goals
• Research and explore emerging security technologies and determine their appropriate use within the company.
• Prepare, document, and create standard operating procedures and protocols.
• Crosstrain and mentor other team members as needed
Must Have Skills / Requirements
1) Implementing advanced cyber security technology in a complex environment
a. 5+ years of experience; Hands-on experience in security engineering, hands-on experience in building, designing, and maintaining enterprise security tools.
2) Scripting experience (using Python, Go, or other equivalent languages)
a. 5+ years of experience.
3) Hands-on Experience with automation technologies
a. 3+ Years of experience; Terraform, Ansible, CloudFormation, etc.
4) Linux Experience.
a. 5+ years of experience; Ability to construct and maintain complex network infrastructures.
Technology requirements:
• Engineer and administer security platforms including SIEM/SOAR systems, endpoint detection and response, vulnerability management, anomaly detection, and cloud analysis.
• Experience in managing the Brinqa vulnerability management platform and experience with Groovy programming language
• Must have 5+ years of scripting experience (using Python or other equivalent languages)
• Hands-on Experience in public cloud infrastructures like AWS (Amazon Web Services)
Nice to Have Skills / Preferred Requirements
1) Security and Cloud certifications are a plus. (CISSP, Splunk Admin, AWS Solution architect).
2) Media/entertainment or distributed global network experience.
Soft Skills
1) Hands-on technical experience with networking and computing system architectures, specifically, the security aspects thereof.
2) Thorough understanding of information security principles, techniques, principles, policy frameworks, and best practices
3) Hands-on technical experience with compliance and regulatory frameworks and how they affect architecture designs and review
Cloud Admin | Hybrid | Up to $150,000 based on experience | Ridgefield NJ
A growing technology organisation is seeking a Senior Engineer to manage and optimise cloud and hybrid infrastructure environments. This role spans both project delivery and operational support, with a strong focus on Linux-based systems alongside Windows, and ownership of cloud platforms and automation initiatives.
Responsibilities
- Manage and optimise cloud infrastructure across multi-platform environments
- Administer and support both Linux and Windows systems
- Perform system maintenance, patching, and upgrades
- Support and manage application deployments via CI/CD pipelines
- Work with containerised environments and orchestration tools
- Implement and maintain Infrastructure as Code solutions
- Contribute to automation and continuous improvement initiatives
- Participate in on-call rotation and support escalations
- Collaborate with internal engineering and development teams
Requirements
- Strong experience with public cloud platforms (AWS and/or Azure)
- Advanced Linux administration skills with some Windows exposure
- Experience with containerisation (Docker, Kubernetes or similar)
- Proficiency in scripting/automation (e.g. Bash, Python, PowerShell)
- Experience with Infrastructure as Code (e.g. Terraform or equivalent)
- Familiarity with CI/CD pipelines and DevOps practices
- Experience with configuration management tools (e.g. Ansible or similar)
Additional Information
- Hybrid working model (onsite presence required)
- Competitive compensation based on experience
- Opportunity to take on increased ownership/leadership over time
Interested? Apply now or send your updated resume to
Senior Dynatrace Engineer responsible for designing, implementing, and maintaining enterprise monitoring solutions. The role focuses on ensuring end-to-end observability across applications, infrastructure, and cloud environments using Dynatrace. The engineer will also provide expertise in performance monitoring, troubleshooting, and proactive incident management.
Key Responsibilities
Monitoring & Observability
· Configure, maintain, and optimize monitoring solutions using Dynatrace.
· Provide end-to-end visibility across infrastructure, applications, and services.
· Develop dashboards, alerts, and health checks to monitor system performance.
· Define monitoring thresholds to reduce false alerts and improve reliability.
Infrastructure Monitoring
· Monitor Windows and Linux servers, virtual environments (VMware), and cloud platforms (AWS, Azure, GCP).
· Monitor databases, middleware, and network infrastructure components.
· Identify system trends and capacity requirements with infrastructure teams.
· Proactively detect and resolve system performance issues.
Application Performance Monitoring
· Monitor application performance and transaction flows using Dynatrace APM.
· Implement synthetic monitoring and real user monitoring.
· Collaborate with development teams to ensure comprehensive monitoring coverage.
· Troubleshoot performance issues across applications and systems.
Incident Management
· Support incident response activities related to performance and availability issues.
· Provide monitoring insights during root cause analysis.
· Identify monitoring gaps and improve monitoring coverage.
Continuous Improvement
· Improve monitoring standards, documentation, and best practices.
· Recommend enhancements to monitoring configurations and alerting strategies.
· Integrate monitoring tools with ITSM platforms such as Jira or ServiceNow.
Required Qualifications
Education
· Bachelor’s degree in Computer Science, Information Systems, or equivalent experience.
Experience
· Minimum 5+ years of experience in systems engineering, infrastructure support, or monitoring roles.
· 5+ years hands-on experience with Dynatrace or similar APM tools.
· Experience migrating Dynatrace Managed to Dynatrace SaaS environments.
Technical Skills
· Strong knowledge of Windows/Linux servers and VMware environments.
· Experience with cloud platforms (AWS, Azure, GCP).
· Understanding of networking concepts such as DNS, TCP/IP, and load balancing.
· Experience with automation or scripting (PowerShell, Bash).
· Knowledge of monitoring baselines, KPIs, and SLAs.
· Familiarity with enterprise log analysis frameworks and tools like Jira or ServiceNow.
· Experience monitoring containerized environments such as Kubernetes (preferred).
Preferred
· Exposure to AWS Solution Architect Professional Certification.
Soft Skills
· Strong troubleshooting and analytical skills.
· Excellent written and verbal communication.
· Ability to collaborate across technical and business teams.
· Detail-oriented with a proactive approach to system monitoring and reliability.