Terraform Githubrepository Jobs in Usa
144 positions found — Page 9
Company Information
Kretek International is America’s #1 importer, marketer and distributor of specialty tobacco products, including high-end cigars, tobacco and alternative products, with such well-known brands as Djarum, Cuban Rounds and High Tea Herbal Wraps. Founded 38+ years ago, Kretek continues to grow and consistently seeks talented employees who can help the company do that. For more information, please visit:
About the Role
We are seeking an experienced Infrastructure Network & Security Engineer to design, implement, and secure our customers' enterprise network infrastructures. This mid-level hybrid engineering position requires 5-7 years of hands-on experience spanning network architecture, implementation, and security operations. The ideal candidate will be a proactive problem-solver who can balance network infrastructure engineering with security monitoring and threat response, working both independently and collaboratively to deliver robust, scalable, and secure network solutions.
Key Responsibilities
Network Design & Architecture
· Design and architect enterprise LAN/WAN network solutions that meet business and security requirements
· Lead network infrastructure projects including upgrades, migrations, and new deployments
· Configure and deploy Cisco routing/switching platforms, firewalls (SonicWall, Cisco, Fortinet), and enterprise wireless solutions
· Monitor and respond to security alerts from Intune, Microsoft Defender, and other security platforms
· Utilize Auvik for network monitoring, mapping, performance analysis, and configuration management
· Implement and manage network security controls including firewalls, VPNs, IDS/IPS, and network access control (NAC)
· Troubleshoot complex network and security issues and perform root cause analysis
· Create and maintain network documentation including topology diagrams, security policies, and implementation plans
· Implement routing protocols (OSPF, BGP), network segmentation, VLANs, and ACLs
· Configure endpoint security policies through Intune and integrate with conditional access
· Mentor junior team members and collaborate with cross-functional IT teams
Required Qualifications
Experience & Education
· 5-7 years of progressive experience in network engineering and infrastructure design
· Bachelor's degree in IT, Computer Science, or related field preferred
Technical Skills - Networking
· Expert knowledge of Cisco platforms (Nexus, Catalyst, ISR/ASR routers)
· Proficiency with SonicWall, Cisco Firewalls, or Fortinet
· Strong understanding of routing protocols (OSPF, BGP), TCP/IP, VLANs, ACLs
· Experience with enterprise wireless (Cisco, Aruba, or Ruckus)
· VPN technologies and network access control (NAC) experience
· Experience with Auvik or similar network monitoring platforms
Technical Skills - Security
· Experience with Microsoft Intune and Defender for Endpoint
· Security alert monitoring and incident response
· Knowledge of security frameworks (NIST, CIS Controls, Zero Trust)
· Familiarity with IDS/IPS, SIEM, and vulnerability scanners
· Understanding of conditional access and identity-based security
Professional Skills
· Excellent troubleshooting and problem-solving abilities
· Strong communication skills for technical and non-technical audiences
· Project management experience with infrastructure initiatives
· Ability to manage multiple priorities in a fast-paced environment
Preferred Qualifications
· CCIE or expert-level networking certifications
· Experience with Cisco ISE deployment
· Network automation tools (Ansible, Python, Terraform)
· Cloud networking experience (AWS, Azure, GCP)
· SIEM platforms (Splunk, Sentinel)
· MSP environment experience
Physical Requirements:
· Ability to stand and walk.
· Ability to reach above, at, or below waist height
· Ability to kneel, bend, stoop, turn and twist
· Ability to lift 25lb regularly and occasionally up to 50 lbs.
Safety:
· The incumbent must be able to perform this job safely without endangering the health or safety of self or others.
Supervisory Responsibility:
· The position currently has no people supervisory responsibility
Please note that this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities and activities may change or be added at any time per business needs.
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, or protected veteran status and will not be discriminated against on the basis of disability.
We are seeking a highly skilled Senior Splunk Enterprise Security Engineer to join our dynamic Security Engineering & Architecture team in Irving, TX. This is an exciting opportunity to lead the deployment, optimization, and administration of our Splunk Enterprise Security (ES) platform within a cloud-based environment. The ideal candidate will have extensive hands-on experience with Splunk ES, cloud security platforms, and a deep understanding of SIEM operations at an enterprise scale. This role offers a chance to work in a complex, high-volume environment and make a significant impact on the organization’s security infrastructure.
Job Title: Senior Splunk Security Engineer
Job Location: Irving, TX 75063
Job Duration: 12 Months with possible extension
Comments: The candidate must have hands-on Splunk expertise, with a minimum of 5+ years specific to Splunk Enterprise Security.
Additional Skills: Splunk Enterprise Security Administrator, Splunk Cloud Administrator.
Job Description:
We are seeking a Senior Splunk Enterprise Security Engineer to join our Security Engineering & Architecture team in Irving, TX. In this high-impact individual contributor role, you will own the deployment, optimization, and day-to-day administration of our Splunk Enterprise Security (ES) platform across a cloud-based environment supporting one of the largest retail operations in the country.
You will be the go-to subject matter expert for Splunk ES, partnering with SOC analysts, threat intelligence teams, compliance stakeholders, and IT leadership to ensure our security monitoring platform delivers maximum visibility, reliability, and value. This is a hands-on, technically deep role for someone who thrives in complex, high-volume environments and takes pride in building resilient security infrastructure.
Responsibilities:
- Lead the end-to-end administration of Splunk Enterprise Security across a cloud-hosted (AWS/Azure/GCP) deployment, including architecture decisions, capacity planning, performance tuning, and version upgrades.
- Design, implement, and maintain ES frameworks including notable event configurations, risk-based alerting, asset and identity correlation, and threat intelligence integrations.
- Develop and optimize correlation searches, dashboards, and investigation workflows to reduce alert fatigue and accelerate analyst response times.
- Drive data source onboarding and ensure CIM (Common Information Model) compliance for new and existing log sources across the enterprise.
- Partner with compliance teams to ensure Splunk ES configurations directly support PCI DSS, SOX, and NIST CSF audit and reporting requirements.
- Establish and maintain health monitoring for the Splunk environment, including search performance, indexing throughput, forwarder connectivity, and license utilization.
- Create and maintain operational documentation, runbooks, and knowledge base articles for Splunk ES administration and troubleshooting.
- Serve as the escalation point for complex Splunk issues and participate in incident response efforts during critical security events as needed.
- Evaluate and recommend new Splunk apps, add-ons, and integrations that strengthen the organization’s security posture.
- Collaborate with Security Architecture peers to align Splunk ES capabilities with the broader security tooling ecosystem and long-term technology roadmap.
Required Skills:
- 5+ years of hands-on experience with Splunk platform administration, with significant depth in Splunk Enterprise Security.
- Active Splunk certifications required: Splunk Enterprise Certified Admin and/or Splunk ES Certified Admin.
- Proven experience managing Splunk deployments in cloud environments (AWS, Azure, or GCP).
- Deep understanding of security monitoring, log management, SIEM operations, and event correlation at enterprise scale.
- Working knowledge of PCI DSS, SOX, and NIST CSF compliance frameworks and how they translate into SIEM use cases and reporting requirements.
- Strong SPL (Search Processing Language) proficiency, including complex statistical commands, lookups, macros, and data models.
- Experience with Splunk infrastructure components: indexers, search heads, heavy/universal forwarders, deployment servers, and cluster management.
- Excellent communication skills with the ability to translate complex technical concepts for non-technical stakeholders.
Desired Skills:
- Experience in large-scale retail or similarly complex, high-transaction-volume environments.
- Familiarity with Splunk SOAR (formerly Phantom) and security automation/orchestration workflows.
- Background in detection engineering, threat hunting, or SOC operations.
- Additional certifications such as CISSP, GIAC (GCIA, GCIH), or cloud security credentials (AWS Security Specialty, AZ-500).
- Experience with Infrastructure as Code (Terraform, Ansible) for Splunk deployment management.
- Scripting proficiency in Python, Bash, or PowerShell for automation and custom integrations.
Title: Senior Splunk Enterprise Security Engineer
Duration: Long term
Location: Irving ,TX
(ONLY W2)
Job Description:
Key Responsibilities
- Lead end-to-end administration of Splunk Enterprise Security in AWS/Azure/GCP
- Perform capacity planning, performance tuning, and platform upgrades
- Manage indexers, search heads, forwarders, deployment servers, and clustering
- Develop and optimize correlation searches, notable events, dashboards, and workflows
- Implement risk-based alerting, asset & identity correlation, and threat intelligence integrations
- Onboard new log sources and ensure CIM compliance
- Monitor platform health (search performance, indexing, license usage, forwarder connectivity)
- Support PCI DSS, SOX, and NIST CSF reporting and audit requirements
- Create runbooks, SOPs, and operational documentation
- Act as escalation point for complex Splunk issues and support incident response
- Evaluate Splunk apps, add-ons, and SOAR integrations
- Automate deployments using Terraform, Ansible, Python, Bash, or PowerShell
Required Skills & Experience
- 5+ years of hands-on Splunk administration with strong Splunk ES experience
- Active Splunk Enterprise Certified Admin and/or Splunk ES Certified Admin
- Experience managing Splunk in cloud environments (AWS, Azure, or GCP)
- Strong SPL (stats, lookups, macros, data models)
- Deep knowledge of SIEM operations, log management, and event correlation
- Experience with Splunk infrastructure components (indexers, search heads, forwarders, clustering)
- Knowledge of PCI DSS, SOX, and NIST CSF frameworks
- Strong communication and stakeholder collaboration skills
Preferred Qualifications
- Experience in large-scale retail or high-transaction environments
- Familiarity with Splunk SOAR (Phantom)
- Background in SOC operations, detection engineering, or threat hunting
- Certifications: CISSP, GCIA, GCIH, AWS Security Specialty, AZ-500
- Experience with Infrastructure as Code for Splunk deployments
We are seeking a senior to advanced level software engineer with strong expertise in front-end development. While this role does include full-stack development, the initial project will be primarily focused on frontend delivery. In addition to application delivery, this role serves as a mentor for less experienced development staff, and close collaboration with our User Experience team.
Key Activities
- Collaborates with UX and graphic designers to deliver visually appealing web solutions adhering to 508 compliance standards and standardized design systems.
- Partners with product owners and customers in the development of innovative solutions that achieve business goals.
- Reviews and analyzes business and technical requirements and implements technical solutions to meet those requirements.
- Works in multidisciplinary team with full-stack developers.
- Apply the principles of software engineering to the design, implementation, configuration, and optimization of multiple web-based applications.
- Creates unit and automation tests as part of Continuous Development.
- Cross browser testing new features.
- Conducts peer code reviews, provides recommendations, and works with peers to improve software coding practices.
- Fixes bugs, supports QA and UAT phases of releases.
- Keeps abreast of latest and emerging technologies.
- Fosters an agile mindset enabling high-performing teams.
- Provides coaching, education and advocates for frontend development best practices.
- Experience with API infrastructure and development, and associated tools and best practices.
- Provides on-call support, troubleshooting, root cause analysis, incident management, and service request management for supported products and environments.
Required Qualifications
- Typically requires 6 – 10 years of relevant experience.
- Bachelor's degree specializing in STEM (Science, Technology, Engineering, Mathematics), or a closely related field, from an accredited college or university, or equivalent combination of directly related education and/or experience.
- Senior to advanced understanding of subject. Has in-depth and/or breadth of knowledge in discipline.
- Proficiency with Java, TypeScript, CSS, HTML methods.
- Senior to Advanced experience with Angular.
- Performs work independently with limited supervision and direction. Serves as a mentor for less experienced staff.
- Works efficiently under tight deadlines and adapts quickly to change.
- Amazing attention to detail and pride in delivering consistently pixel perfect work.
- Creation of modern CI/CD pipelines using DevOps tooling (e.g. Jenkins, Git, Bitbucket, GitLab, Fortify, Sonar, etc.).
- Knowledge of AWS services and security best practices.
- Cloud networking across numerous accounts, environments, and vendors, and zero trust principles.
- Terraform to deploy AWS cloud services and infrastructure.
Preferred Qualifications
- Strong expertise in the creation and/or practical application of components in design systems (versus only having exposure to pattern libraries).
- Advanced experience with multiple programming languages (Java, Python, etc.).
- Advanced knowledge of some cloud-based platforms like AWS, Azure, or Google Cloud, etc. and the ability to learn new platforms.
- Willingness to become proficient in any new programming language or tool quickly.
- Experience with centralized application observability and monitoring across disparate tools and services.
UNIX Administrator with excellent technical, process and automation skills to be part of High-Performance Cloud Operations Team. As an Infrastructure Administrator, this person is responsible for the daily administration of Linux and Unix servers in a business application environment. This includes general system administration tasks, software and hardware support, system configuration, system monitoring. This person must have excellent Linux/Unix administration experience, with customer relation skills. Candidate should be able to work with business application administrators, helping troubleshoot their applications and guide them with standard methodologies. Candidate must be able to express thoughts clearly and capable of working in a team or as a sole contributor. Individual should be self-motivated with very good communication skills. Main point responsible for the overall operability, resiliency, performance, and capacity of owned production services.
What you'll do
- System Administration - This person would be responsible for the day-to-day administration of all Linux based servers. This includes monitoring the trouble ticket queue, system troubleshooting, hardware and software system changes, scripting, patching, system performance monitoring, system sizing, system integration, upgrade implementation, and hardware diagnostics.
- Application support – This person would work with application administrators to help fix and fine-tune applications and also if required guide application administrators in standard processes related to using the underlying UNIX infrastructure.
- Documentation – Maintain all system documentation.
What you need to succeed
- Unix/Linux System Administration: In-depth experience with Unix/Linux servers (especially Suse, AIX, RHEL, CentOS) for installation, configuration, patching, and troubleshooting.
- Automation & Scripting: Proficiency in scripting (Bash, Python) and automation tools (Ansible, etc.) to streamline deployments and manage configurations.
- Demonstrable ability to perform UNIX builds,
- Understanding of RedHat Satellite, IBM NIM, or SUSE Manager for patch management.
- Networking Knowledge: Strong grasp of networking (TCP/IP, DNS, SSH, etc.) and system connectivity for effective troubleshooting in distributed environments.
- Working knowledge of Virtual machine management (VmWare, OpenShift) TCP/IP functionality, networking, Remote administration, cloning, migration, etc.
- Security Best Practices: Expertise in system security – user access controls, OS hardening, patch management, and compliance.
- Soft Skills: Strong communication, teamwork, and problem-solving skills to collaborate across teams and resolve complex issues efficiently.
- Operational experience with Ansible and Terraform are beneficial.
Senior Technical Support Engineer
Location: San Francisco, CA | Raleigh, NC | Dallas, TX | Boston, MA
Schedule: Hybrid – 3 days onsite required
Employment Type: 6-Month Contract-to-Hire
Pay Rate: $65–68/hour
Start Date: ASAP
About the Role
The Technical Solutions team is focused on advancing care and research innovation. We support new business initiatives by expanding product capabilities in strategic areas and delivering a scalable technical support framework across multiple product portfolios.
As a Senior Technical Support Engineer, you will partner closely with internal stakeholders to identify, reproduce, troubleshoot, and resolve complex technical issues. You will support infrastructure, permissions, and configuration changes while delivering high-level technical support and sustaining engineering services that help customers achieve meaningful business outcomes.
This role offers the opportunity to collaborate with customers, developers, architects, and operations teams to solve challenging, high-impact problems. You will also contribute to building support tooling and infrastructure to improve operational efficiency.
Travel up to 10% may be required.
Key Responsibilities
- Own and manage technical customer issues from identification through full resolution
- Reproduce and troubleshoot complex technical problems, including reviewing and analyzing code to determine root cause
- Project manage new client deployment issues through to completion
- Implement infrastructure, security, and permissions configuration changes
- Drive operational efficiencies by identifying improvements in process, tooling, and product functionality
- Develop playbooks and knowledge base documentation to streamline issue resolution
- Create internal reports and dashboards for issue tracking and performance monitoring
Minimum Qualifications
- Bachelor’s degree in Computer Science, Information Systems, Mathematics, Statistics, or related field
- Cloud operations experience (creating buckets, virtual machines, and managing security access controls/IAM)
- 3+ years of experience with Python or another object-oriented programming language
- 3+ years of experience working with SQL
- Experience troubleshooting data-related issues
- Proficiency with GitHub and Jira
- Strong troubleshooting skills with the ability to track complex technical details
- Excellent communication skills with the ability to translate technical findings for both senior developers and non-technical stakeholders
Preferred Qualifications
- 4+ years of experience in healthcare technology
- Experience supporting highly regulated software environments
- Experience with R
- Infrastructure-as-Code (IaC) experience such as Terraform, Ansible, or similar tools
- Self-starter mindset with strong ownership and a passion for driving issues through to resolution
Technical Lead / Platform Engineer
GenAI Healthcare SaaS Platform
Palo Alto, CA (Hybrid – 3 days onsite)
About the Role
We’re building a next-generation generative AI platform purpose-built for healthcare — and we’re looking for a hands-on Technical Lead to help architect and scale it from the ground up.
This is not a “manage from a distance” role. You’ll be deeply involved in system design, writing production code, guiding engineering direction, and making foundational technical decisions that shape the future of the platform. You’ll work closely with leadership and product to turn complex healthcare AI requirements into scalable, reliable infrastructure.
If you’ve led engineering efforts for production-grade AI systems and enjoy building high-performance teams and platforms, we’d love to talk.
What You’ll Own
- Architect and evolve the core infrastructure for a scalable AI-powered SaaS platform
- Lead development of backend services and ensure clean integration across systems
- Write high-quality, maintainable code with long-term scalability in mind
- Establish engineering best practices across architecture, CI/CD, monitoring, and DevOps
- Guide and mentor engineers while maintaining a strong individual contributor presence
- Define infrastructure automation strategies (IaC, provisioning, deployment workflows)
- Drive performance, reliability, and observability standards
- Partner with product and leadership to translate vision into executable technical milestones
What We’re Looking For
Core Experience
- 8+ years building and shipping production software
- Prior experience leading engineering initiatives or managing technical direction
- Proven track record scaling a generative AI system or AI-enabled platform in production
- Strong backend engineering background in Python
- Additional proficiency in TypeScript or Java
- Experience with a modern compiled language such as Go or Rust
- Deep knowledge of microservices architecture and RESTful API design
- Hands-on expertise with one major cloud provider (AWS, GCP, or Azure)
- Strong experience with Infrastructure as Code (Terraform preferred)
- Advanced DevOps knowledge: CI/CD, automated deployments, monitoring, alerting
- Experience containerizing and orchestrating services with Docker and Kubernetes
Leadership & Execution
- Comfortable setting technical direction while staying hands-on
- Strong communicator who can explain complex systems clearly
- Experience mentoring engineers and elevating engineering standards
- Ability to operate in a fast-moving, startup environment
Nice to Have
- Experience building or deploying ML/AI systems (TensorFlow, PyTorch, etc.)
- Strong GCP background (bonus if you’ve worked across AWS/Azure as well)
- Familiarity with frontend frameworks like React, Angular, or Vue
- Experience working in healthcare, health tech, or regulated industries
Work Environment
This role is based in Palo Alto with a hybrid schedule (3 days per week onsite). We provide daily meals and foster a collaborative, high-performance engineering culture.
Compensation & Benefits
- Base salary range: $220,000 – $280,000, depending on experience and location
- Meaningful equity package
- Medical, dental, and vision coverage
- Flexible hours
- Hybrid work environment
- Opportunity to help shape the future of AI in healthcare
If you're excited about building a production-grade AI platform that directly impacts patient outcomes — and want to help define the technical foundation of a category-defining company — let’s talk.
The Aspen Group (TAG) is one of the largest and most trusted retail healthcare business support organizations in the U.S. and has supported over 20,000 healthcare professionals and team members with close to 1,500 health and wellness offices across 48 states in four distinct categories: dental care, urgent care, medical aesthetics, and animal health. Working in partnership with independent practice owners and clinicians, the team is united by a single purpose: to prove that healthcare can be better and smarter for everyone. TAG provides a comprehensive suite of centralized business support services that power the impact of five consumer-facing businesses: Aspen Dental, ClearChoice Dental Implant Centers, WellNow Urgent Care, Chapter Aesthetic Studio, and Lovet Pet Health Care. Each brand has access to a deep community of experts, tools and resources to grow their practices, and an unwavering commitment to delivering high-quality consumer healthcare experiences at scale.
As a Senior Site Reliability Engineer (SRE) at TAG – The Aspen Group, you will be responsible for ensuring the reliability, performance, and scalability of our core systems. This role involves proactively building and managing, monitoring solutions, lead incident response, and continuously optimizing system performance to exceed business objectives. We are actively integrating AI and machine learning into our operational workflows, and you will be on the front lines, leveraging intelligent automation and machine learning to build a proactive resilient infrastructure. This is an opportunity to go beyond SRE by applying cutting-edge technology to solve complex reliability challenges.
Responsibilities:
Intelligent Site Reliability Engineering:
- Design and build highly scalable and resilient systems to support our applications and services, incorporating predictive analytics to anticipate reliability risks.
- Develop and manage Service Level Objectives (SLOs) and Service Level Indicators (SLIs) using machine learning anomaly detection to ensure systems meet reliability targets.
- Drive improvements in system reliability, availability, and performance through proactive measures, automation, and intelligent failure prediction.
Advanced Observability:
- Implement and manage comprehensive monitoring and alerting solutions, integrating with intelligent observability platforms that reduce alert noise and correlate events.
- Develop and maintain dashboards and reporting tools that provide data-driven insights for actionable troubleshooting recommendations and performance optimization.
- Evaluate and integrate advanced monitoring tools and operational intelligence platforms to enhance observability and root cause identification.
Proactive Incident Management:
- Lead and participate in incident response efforts, using intelligent log analysis and automated event correlation to speed up troubleshooting and root cause identification.
- Develop and maintain incident management processes incorporating automated decision support systems to improve response times and minimize service disruptions.
- Conduct post-incident reviews, using automated pattern recognition and trend analysis to identify systemic issues and implement preventive measures.
Performance and Capacity Optimization:
- Analyze performance metrics and logs, supported by advanced observability tools, to detect bottlenecks and inefficiencies.
- Collaborate with development teams to implement automated profiling and optimization recommendations for code and infrastructure improvements.
- Perform capacity planning using machine learning forecasting models to ensure systems can handle current and future loads.
Automation and Process Improvement:
- Develop and implement automation solutions, including intelligent runbook automation, self-healing systems, and automated incident triage.
- Identify and drive process improvements by applying machine learning to operational data for continuous optimization.
- Maintain documentation that includes automation and machine learning guidelines for monitoring, incident management, and SRE best practices.
Collaboration and Communication:
- Work closely with engineering, operations, and product teams to align reliability and monitoring goals, including automation adoption strategies.
- Communicate effectively with stakeholders, providing regular updates on system health, incidents, performance improvements, and data-driven insights.
- Foster a culture of collaboration, knowledge sharing, and automation best practices within the team and across the organization.
Requirements:
- Bachelor's degree in computer science or a related technical field.
- At least 5 years of experience in Site Reliability Engineering or a similar role.
- Strong proficiency in at least one programming language such as Python, Go, or C#
- Demonstrated experience applying machine learning and automation to operational workflows such as monitoring, alerting and incident response.
- Expertise with infrastructure as code tools such as Terraform
- Proven experience working and monitoring container environments such as Cloud Run and Kubernetes.
- Hands-on experience using and working within an Azure, AWS, and GCP environment (GCP preferred)
- Strong understanding of networking, distributed systems, and cloud infrastructure.
- Familiarity with intelligent monitoring platforms and operational analytics tools such as Prometheus, Grafana, OpenSearch, Sentry, Google Cloud Observability
- Excellent problem-solving skills and the ability to work independently and as part of a team.
- Experience with incident management, root cause analysis, and automated operational workflows.
Annual pay range: $129,000-$160,000
A generous benefits package that includes paid time off, health, dental, vision, and 401(k) savings plan with match
Our Ideal Candidate
We are seeking a Senior Backend Engineer with 5+ years’ experience building scalable, secure systems using .NET, cloud services, and modern API frameworks. You should have expertise in high-performance backend architecture, database management (both relational and non-relational), and HIPAA-compliant data handling in regulated industries. Proficiency in AWS, serverless development, CI/CD automation, and cross-team collaboration is required. Experience with microservices, containerization, IaC, or healthcare data standards is a plus.
Responsibilities
- Design, develop, and maintain RESTful APIs and backend services using .NET and related technologies.
- Architect scalable and secure server-side applications to support healthcare data workflows.
- Manage relational and non-relational databases to ensure data integrity and performance.
- Collaborate with front-end developers, DevOps, and QA teams to deliver integrated solutions.
- Implement authentication, authorization, and data protection mechanisms in compliance with HIPAA.
- Optimize backend performance and resource usage across cloud platforms (AWS, Azure).
- Automate backend processes and contribute to CI/CD pipelines.
- Maintain documentation for backend architecture, APIs, and data models.
Qualifications
- B.S. Computer Science degree or equivalent experience
- 5+ years of experience in backend development, preferably in healthcare or regulated industries.
- Proficiency in .NET (C#), ASP.NET Core, and server-side frameworks.
- Strong understanding of database systems (SQL Server, PostgreSQL, MongoDB).
- Experience with cloud platforms such as AWS or Azure.
- Familiarity with scripting languages (Python, PowerShell, Bash).
- Knowledge of networking, security, and system administration principles.
- Excellent problem-solving and communication skills.
Preferred Qualifications
- Experience with healthcare data standards and HIPAA compliance.
- Exposure to microservices architecture and containerization (Docker, Kubernetes).
- Experience with infrastructure as code tools (Terraform, CloudFormation).
- Certifications such as Microsoft Certified: Azure Developer Associate or AWS Developer Associate.
Our Ideal Candidate
We are seeking an experienced cloud and DevOps engineer with over 5 years of experience designing, automating, and maintaining scalable AWS infrastructure, CI/CD pipelines, and secure cloud environments. In the role of Senior Cloud Platform Engineer, you should demonstrate expertise in Infrastructure as Code, scripting, containerization, and modern monitoring or alerting platforms, as well as strong skills working across teams. Success in this position requires a talent for optimizing cloud resources, ensuring security and compliance, and facilitating fast, reliable software deployments. Having experience with HIPAA-compliant systems, .NET platforms, or serverless computing is considered a significant advantage.
Responsibilities
- Design, implement, and maintain CI/CD pipelines using tools like AWS CDK, AWS CodePipeline, or GitHub Actions.
- Manage infrastructure as code (IaC) using Terraform, CloudFormation, or similar tools.
- Monitor system performance and availability using tools like CloudWatch, Prometheus, Grafana, or Datadog.
- Automate repetitive tasks and deployment processes to improve team efficiency.
- Collaborate with software engineers, QA, and product teams to ensure smooth deployments and rapid iteration.
- Implement and enforce security best practices and compliance across infrastructure and deployment pipelines.
- Identify optimizations to reduce cloud resource usage across AWS accounts.
- Maintain documentation for infrastructure, processes, and compliance requirements.
- Work with multiple teams to implement their deployments using common practices.
- Manage Builds and the corresponding documentation
- Monitor package versions, track EOL dates, and upgrade to keep infrastructure current
Qualifications
- B.S. Computer Science degree or equivalent experience.
- 5+ years of experience in DevOps, Site Reliability Engineering, or related roles.
- 2+ years of hands-on AWS Experience
- Strong experience with cloud platforms (AWS, Azure, or GCP).
- Proficiency in scripting languages such as Bash, Python, or PowerShell.
- Experience with containerization and orchestration (Docker, Kubernetes).
- Familiarity with monitoring, logging, and alerting tools.
- Solid understanding of networking, security, and system administration.
- Strong communication skills and ability to work cross-functionally.