Codex Github Jobs in Usa
217 positions found — Page 6
Job Title: Rancher & Kubernetes SME
Location: Princeton, NJ - 08540
Mode: Contract Role – Onsite
only W2
Minimum 15+ years of experience required.
Qualifications:
- Design and implement Rancher-managed Kubernetes clusters (RKE, RKE2, K3s, EKS, AKS, GKE).
- Architect high availability (HA) Rancher setups.
- Define multi-cluster and multi-tenant strategies using Rancher projects, namespaces, and RBAC.
- Integrate Kubernetes with VMware, Bare Metal, and Cloud platforms.
- Establish standardized cluster blueprints and reference architectures.
- Act as final escalation (L3) for Kubernetes and Rancher incidents.
- Diagnose and resolve Control plane failures
- etcd performance and corruption issues
- Pod scheduling and node pressure issues
- CNI (Calico / Cilium) networking problems
- CSI storage failures (Ceph, Longhorn, EBS, Azure Disk, NFS)
- Perform root cause analysis (RCA) and provide preventive recommendations.
- Install, upgrade, and maintain Rancher Server.
- Manage cluster lifecycles using Rancher UI & APIs.
- Implement and manage Rancher RBAC, Authentication (AD / LDAP / Azure AD / SSO)
- Global & cluster-level policies
- Maintain Rancher backups, DR, and recovery procedures
- Enforce Kubernetes security best practices like Pod Security Standards (PSS)
- Network policies and Secrets management
- integrate Kubernetes with CI/CD tools e.g., GitHub Actions, GitLab CI, Jenkins, Argo CD
- Enable GitOps workflows for application and cluster configuration.
- Support Helm chart development and lifecycle management.
- Assist development teams with Deployment strategies, Resource optimization
- Troubleshooting application issues on Kubernetes
Experience:
- 6–10+ years in Linux / Infrastructure / Cloud
- 3–5+ years hands-on Kubernetes production experience
- Strong expertise in Rancher (RKE / RKE2 / K3s)
- Deep understanding of:
- Kubernetes control plane
- etcd
- Networking (CNI)
- Storage (CSI)
Thanks,
Rahul Gupta
Direct: (732) 743-7543
Position Summary
Our client is building a modern, cloud-native platform that powers connected, data-driven manufacturing operations. Their technology sits at the center of increasingly automated factories, integrating equipment, software systems, and real-time production data into a scalable SaaS platform used by global manufacturers.
To support rapid growth and platform scale, they are seeking a Senior Cloud Operations Engineer to own the reliability, performance, and operational excellence of their cloud infrastructure. This is a highly impactful role responsible for ensuring the platform remains highly available, secure, and scalable as adoption continues to grow.
This position is ideal for engineers who thrive in modern cloud environments, enjoy solving complex reliability challenges, and prefer automating everything possible. The right person will combine deep technical expertise with strong operational discipline, helping build a world-class cloud platform supporting real industrial environments.
Key Responsibilities
Cloud Operations & Reliability
• Maintain and optimize production, staging, and development environments running in Kubernetes on AWS
• Implement and manage monitoring, logging, alerting, and observability frameworks
• Lead incident response efforts and drive post-incident reviews focused on continuous improvement
• Own backup, disaster recovery, and business continuity processes
• Perform system capacity planning and performance tuning
Automation & Infrastructure Management
• Build and maintain Infrastructure-as-Code using tools such as Terraform or Pulumi
• Automate provisioning, configuration management, and environment lifecycle processes
• Identify and eliminate operational inefficiencies through automation
• Manage secrets, environment configuration, and version control across infrastructure environments
Security & Compliance
• Implement and maintain least-privilege access models and cloud security guardrails
• Support vulnerability management, patching workflows, and dependency maintenance
• Assist with compliance readiness efforts including SOC 2, ISO 27001, or similar frameworks
• Ensure proper logging, retention, and audit practices across cloud environments
FinOps / Cost Optimization
• Monitor and optimize cloud spend across services and environments
• Implement tagging standards, budget alerts, and cost visibility frameworks
• Recommend architectural improvements to balance performance and cost efficiency
Collaboration & Leadership
• Partner closely with engineering teams to improve reliability, deployment pipelines, and system architecture
• Mentor engineers on operational best practices and cloud platform management
• Develop runbooks, documentation, and operational standards
• Champion reliability engineering principles, operational maturity, and risk reduction practices
Technical Environment
Candidates should be comfortable working in modern cloud-native environments and familiar with:
• Kubernetes clusters, autoscaling, Helm charts, and service mesh concepts
• AWS cloud services including compute, networking, storage, and cost management
• Infrastructure-as-Code frameworks such as Terraform
• Observability platforms such as Datadog, CloudWatch, Prometheus, or New Relic
• CI/CD tools such as GitHub Actions, Bitbucket Pipelines, or Bamboo
• Linux systems administration and troubleshooting
• SRE practices including SLIs, SLOs, MTTR, RTO/RPO, and incident management
Role description
At Tata Technologies we make product development dreams a reality by designing, engineering, and validating the products of tomorrow for the world’s leading manufacturers. Due to our continued growth, we are now recruiting for a below position
Role Overview
We are looking for an experienced xIL Onsite Coordinator to lead and coordinate development and integration activities related to xIL platforms, test automation, CI/CD pipelines, and virtualization environments for automotive software validation.
This role will act as the technical interface between the customer and offshore engineering teams, ensuring smooth execution of HIL/SIL automation, DevOps integration, and platform development.
Key Responsibilities
- Coordinate development and integration of Python-based xIL automation libraries and frameworks.
- Support implementation of test automation frameworks (Robot Framework or similar) for automotive testing.
- Manage CI/CD pipelines for automated test execution, build, deployment, and reporting.
- Coordinate integration of HIL/SIL platforms and automotive test environments.
- Develop and maintain automation scripts and workflows using Python and YAML.
- Support development of REST APIs and backend services for platform integration.
- Work with automotive communication protocolssuch as CAN, UDS, and XCP.
- Act as the onsite technical coordinator between customer teams and offshore engineering teams.
- Troubleshoot platform, automation, and integration issues to ensure smooth project execution.
Required Skills
- Strong Python programming and automation development experience.
- Hands-on experience with CI/CD pipelines (GitHub Actions or similar tools).
- Experience with Robot Framework or other automation frameworks.
- Knowledge of HIL/SIL testing platforms and dSPACE toolchain.
- Experience with REST APIs and backend integrations.
- Knowledge of automotive communication protocols (CAN, UDS, XCP).
- Experience with Git version control.
- Strong communication and stakeholder coordination skills.
Good to Have
- Knowledge of ASAM standards and xIL APIs.
- Experience with automotive calibration tools (INCA, CANape).
- Exposure to cloud platforms (AWS/Azure/GCP).
- Experience with Docker/Kubernetes.
- Experience working in Agile/Scrum environments.
Equal Opportunity Statement:
Tata Technologies Inc. is an Equal Opportunity/ Affirmative Action employer. We provide equal employment opportunities to all qualified employees and applicants for employment without regard to race, religion, sex, age, marital status, national origin, sexual orientation, citizenship status, veteran status, disability, or any other legally protected status. We prohibit discrimination in decisions concerning recruitment, hiring, compensation, benefits, training, termination, promotions, or any other condition of employment or career development.
Tata Technologies: Engineering a better world.
Tata Technologies would like to thank all applicants for their interest, each application will be reviewed against the set criteria for the role. We would like to advise that only candidates under consideration will be contacted. If you do not hear from us within 10 working days following the closing date it will mean that unfortunately your application has not been successful. We will however retain your details for any suitable future opportunities.
- Includes Static Application Security Testing (SAST), Dynamic Application Security Testing (DAST), API security testing, AI/ML platforms, and penetration testing
- Ensuring compliance with industry standards such as OWASP Top10, CWE, CVE, and NIST guidelines
Required Technical Knowledge& Competencies
- Expertise in SAST, DAST, API security testing, and penetration testing.
- Strong programming knowledge (Java, .NET, Python, JavaScript) for code level analysis,
- Background of Development
- Build, maintain, and secure automation pipelines using tools like Jenkins, GitLab CI, or GitHub Actions, ensuring security scans occur at every code commit.
- Implement and manage security tools, including Static Application Security Testing (SAST), Dynamic Application Security Testing (DAST), Container Security (e.g., Trivy), and dependency scanning
- Use tools like Terraform or Ansible to deploy secure, compliant infrastructure.
- Proactively identify, prioritize, and remediate security vulnerabilities in application code and infrastructure.
- Ensure compliance with industry standards (e.g., PCI-DSS, GDPR) by embedding compliance-as-code into the development workflow.
- Act as a security advocate, working with DevOps and Development teams to foster a \"security first\" culture. Familiarity with cloud security testing (AWS, Azure, GCP),
- Experience with container security (Docker, Kubernetes),
- Excellent communication and stakeholder management skills.
Qualifications
- Bachelor’s degree in computer science, Information Security, or related field,
- 6-8 years of IT experience, with at least 5+ years in application security testing.
- Preferred certifications: OSCP, CEH, GWAPT, CISSP
DevOps Architect
Los Angeles, CA - Onsite (Day 1)
Long Term Contract
Skills Required:
- AWS & GCP
- Docker & Kubernetes
- Pulumi
Job Description
We are seeking a highly skilled Senior DevOps Architect with deep expertise across multi‑cloud environments and modern DevOps tooling. The ideal candidate is an SME with strong hands‑on experience building, automating, deploying, and optimizing infrastructure at scale.
Key Responsibilities
- Serve as a DevOps SME with 8+ years of multi‑cloud experience, including AWS, GCP, and hypervisor frameworks.
- Expertise in managed cloud services such as Lambda, Cloud Functions, S3 (large volumes), Elasticsearch, Step Functions, DynamoDB, Aurora, and other RDS services.
- Strong background in Docker-based container platforms and CI/CD workflows.
- Advanced scripting and automation capabilities with Terraform, IaC, and Pulumi.
- Ability to write reusable modules and infrastructure code (Python preferred).
- Strong SQL skills and understanding of relational and non-relational databases; proficiency in database tuning.
- Experience working with multiple build systems: npm, Maven, Poetry, Mono, ReactJS, VueKit.
- Proficient in all aspects of Kubernetes, including deployment automation using Helm and Kustomize.
- Ability to understand APIs, create reusable CI/CD modules, and document work in GitHub.
- Experience leading offshore DevOps/System Engineers and enforcing IaC adoption.
- Collaborate with AWS, GCP, Apple Cloud/Hybrid Cloud teams to troubleshoot 3PC issues.
- Skilled in observability tooling, incident management, and performance optimization.
- Strong knowledge of networking (DNS, load balancers, VPNs, VPCs, firewalls, access control).
- Experience building modules using Terraform, AWS CDK, or Pulumi.
- Knowledge of Java is a plus.
Must Have Qualifications
- Proven leadership and mentoring experience.
- Deep understanding of security best practices, vulnerability mitigation, and risk management.
- Performance tuning and optimization expertise.
- Experience with disaster recovery and backup strategies.
- Strong experience in hybrid cloud environments.
Looking for a 10+ years of experience in full stack software development, including front-end, back-end, and database technologies in Bothell, WA and its an onsite role.
- Bachelor’s degree in computer science, software engineering, or a related field preferred
- Proficiency in modern programming languages and frameworks such as Python, JavaScript, Java, Next JS, Node.js, React js
- Strong working experience with GenAI, LLM Models, MCP, Vector DB, RAG, Vertex AI, Agentic AI frameworks like NGA, ADK or LangChain/LangGraph, creating AI agents.
- Strong experience with Cloud platforms like GCP, Azure or AWS and cloud technologies like OpenStack, Terraform, Ansible or Chef
- Experience working with LLM observability, analytics, evaluations, testing and annotation using tools like LangSmith, LangFuse, Streamlit, Arize or similar tools.
- Strong experience working with AI/ML development
- Strong experience working with Databases like Cassandra, MongoDB or similar.
- Strong understanding and working experience of microservices architecture, RESTful APIs, Caching and related technologies
- Familiarity with containerization and orchestration tools such as Docker and Kubernetes
- Proficiency in version control systems like Git, and experience with CI/CD tools such as Jenkins, GitHub, Maven, Nexus, JFrog or Sonar
- Strong experience in Unit and Function testing using Junit, Mockito/JMock, Selenium, Robot, Cucumber, SoapUI or Postman
- Strong problem-solving, analytical, and debugging skills.
- Excellent written and verbal communication skills, with the ability to effectively communicate complex technical concepts to both technical and non-technical audiences.
- Demonstrated experience in mentoring and providing technical leadership to other engineers.
- Nice to have skills:
- Google CCAI platform (DialogFlow), Vertex AI, Graph QL, BigQuery, Conversation Graph, LLM as Judge
Must-Haves:
- Bachelor's degree
- 5 Years+ Development experience, at least 3 of which should be in a non-junior role
- Ability to at minimum read/understand Ansible/Python code (preferrable if they can develop in those languages)
- Experience designing and implementing development solutions for complex problems that comprised at least 2 months of work.
- Experience running projects that span multiple teams or disparate segments
- Ability to provide a GitHub link to a repository of development examples (personal projects are acceptable)
- Ability to communicate complex technical details in a direct, concise and understandable way.
- History in working at least one effort which required code to direct connect to servers, virtual machines or network devices.
- Experience in developing code utilizing API calls
- At least one job where troubleshooting or code testing was a demonstrated component of the work.
- At least one job where mentoring/training was a demonstrated component of the work.
Plus:
- Experience working with Ansible and Python, or (less preferred) Powershell
- AWX, AAP and/or Puppet Enterprise experience
- Previous work in IT within a regulated industry (banking or government)
- ServiceNow experience
- SysAdmin, Platform management or Infrastructure Engineering Experience
- Professional Presentation experience beyond immediate management
- Experience in designing a solution to mitigate a risk concern
Principal System Embedded Engineer (SONiC)
Duration – 6 months to Hire
Location – San Jose CA – Hybrid / 2 days
Notes - Manager is looking for someone with SONiC community level experience & GitHub link .
Please provide a note up of 7-8 lines depending on the Community level experience when comes to SONiC
Responsibilities:
- Design, develop, and maintain features and enhancements for the SONiC NOS platform, interfacing with SAI and platform infrastructure.
- Contribute to the SONiC open-source community and stay current with the evolving SONiC ecosystem.
- Develop forwarding features on SONiC and the underlying hardware (e.g., ASICs, PHYs, optics, and other platform components).
- Implement code for critical system modules, drivers, and APIs supporting high-performance data planes and control planes.
- Debug, troubleshoot, and resolve issues on SONiC platforms.
- Participate in code reviews, and documentation efforts.
- Contribute to architecture discussions to ensure scalable and highly available SONiC integrations.
- Contribute to SONiC SAI features and platform-specific management/control modules (e.g., telemetry, diagnostics, and monitoring components).
Basic Qualifications:
- Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related field.
- Minimum of 10 years of work experience is required.
- With at least 1 year of hands-on SONiC development experience is must.
- Strong experience with the SONiC network operating system and architecture.
- Demonstrated feature contributions to the SONiC open-source community.
- Experience using SONiC SAI for new feature development and integration.
- Experience with datapath forwarding features such as BFD, FIB, RIB, ERSPAN, ACLs, QoS, unicast, and multicast.
- L2/L3 Protocol Stack Development
- L3: BGP, OSPF, IS-IS, EVPN/VXLAN, MPLS, etc…
- L2: STP, LLDP, LACP, etc…
- Experience with FRR open routing stack
- Experience with Redis DB, Docker
- Experience in Data Plane/Embedded software development/kernel drivers.
- Proficient in Python, C/C++.
- Familiarity with Linux internals and containerized environment.
- Excellent problem-solving skills and ability to work in a fast-paced, collaborative environment.
- Knowledge of network ASICs (e.g., Broadcom, Marvell) and switch hardware architecture.
Additional Skills:
Cloud Architectures, Cross Domain Knowledge, Design Thinking, Development Fundamentals, DevOps, Distributed Computing, Microservices Fluency, Full Stack Development, Security-First Mindset, Solutions Design, Testing & Automation, User Experience (UX)
Crown Equipment Corporation is a leading innovator in world-class forklift and material handling equipment and technology. As one of the world’s largest lift truck manufacturers, we are committed to providing the customer with the safest, most efficient and ergonomic lift truck possible to lower their total cost of ownership.
Primary Responsibilities
- Design, develop, and maintain automated test scripts using Selenium and Intellij IDE.
- Create reusable and clean code that supports robust testing frameworks. Integrate automated tests into CI/CD pipelines to enable continuous testing.
- Conduct comprehensive API testing ensuring thorough end-to-end integration across various systems.
- Perform database testing and writing SQL queries across multiple database management platforms.
- Analyze requirement specifications focused on determining the viability and feasibility of automation for eligible features.
- Debug and resolve automation failures.
- Maintain the automation repository.
- Manage the execution of automation regression suites.
- Perform functional and non-functional testing of software products and solutions developed.
- Perform regression testing of module firmware as needed.
- Write, revise, and verify quality standards and test procedures for program design and product evaluation to attain quality of software.
- Develop processes and procedures to test product requirements, use cases, and wireframes in the form of test cases and other documentation.
- Perform requirement analysis and test estimation of software under test.
- Design test cases according to the quality standards and procedures.
- Define test data and test environment requirements to execute defined tests.
- Perform defect reporting, management and closure as per department standards and procedures.
- Participate proactively in QA initiatives for continuous improvements according to department objectives.
Minimum Qualifications
- Bachelor’s degree (Computer Science, Information Systems) and at least 2 years related experience. Non-degree considered if 12+ years of related experience along with a high school diploma or GED
- Able to automate test scripts using Selenium and Intellij IDE.
- Proficient in at least one mainstream programming language such as Java, Python, C#, JavaScript/TypeScript.
- Experience with code versioning and CI/CD tools like GitHub, Jenkins, Bamboo or similar tools to integrate and run the automated tests in pipelines and enable continuous testing.
- Experience with API testing tools like POSTMAN, SOAP UI, RestAssured.
- Experience in writing SQL queries and database testing using MySQL, SQL Server, Oracle, or PostgreSQL.
- Experience in quality assurance methodologies or software testing
- Good written, verbal, analytical and interpersonal skills.
Work Authorization:
Crown will only employ those who are legally authorized to work in the United States. This is not a position for which sponsorship will be provided. Individuals with temporary visas or who need sponsorship for work authorization now or in the future, are not eligible for hire.
No agency calls please.
Compensation and Benefits:
Crown offers an excellent wage and benefits package for full-time employees including Health/Dental/Vision/Prescription Drug Plan, Flexible Benefits Plan, 401K Retirement Savings Plan, Life and Disability Benefits, Paid Parental Leave, Paid Holidays, Paid Vacation, Tuition Reimbursement, and much more.
EOE Veterans/Disabilities
Remote working/work at home options are available for this role.
Position: Infra Maintenance- VMWare Consultant
Location: San Jose, CA (Hybrid)
Employment: Contract
Required Skills:
- Strong hands-on experience in deployment and troubleshooting of VMware vSphere (must-have).
- Practical experience managing ESXi hosts, clusters, storage, and virtual networking.
- Working knowledge of Kubernetes concepts (nodes, pods, services, namespaces, ingress, storage, RBAC).
- Experience installing and operating Kubernetes clusters on bare metal or virtualized environments.
- Experience with Canonical MAAS for bare-metal provisioning, commissioning, and deployment workflows.
- Hands-on experience with Cisco switch configuration (CLI-based).
- Solid understanding of L2/L3 networking concepts:
- VLANs, trunking, STP
- Static routing, basic dynamic routing concepts
- Subnetting, routing tables, ACLs, NAT basics
- Strong experience with Linux administration (Ubuntu/CentOS/RHEL or similar).
- Hands-on experience with Terraform and/or Ansible
- Familiarity with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI, GitHub Actions, Azure DevOps) for automated infra and application deployments.
- Understanding of version control (Git) and standard branching/PR workflows.