Job Summary:
The University of Wisconsin-Madison School of Medicine and Public Health (SMPH) is embarking on an exciting mission of establishing state of the art computational, data, and informatics infrastructure for supporting cutting edge biomedical research and innovation to care. We are seeking an HPC DevOps Engineer to join our Informatics team and help us design, develop, and deploy complex research computing and computational science workflows that support researchers.
The Wisconsin Research, Innovation and Scholarly Excellence (RISE) Initiative at UW-Madison is dedicated to addressing significant, complex challenges through interdisciplinary collaboration, strategic faculty hiring, and research infrastructure enhancement. The initiative focuses on areas such as AI, Earth sciences, and human-centered technology to drive groundbreaking discovery and education.
The HPC DevOps Engineer will support the research data services offered through SMPH as part of the RISE Initiative by managing and optimizing high-performance computing resources. This position involves collaborating with researchers, faculty, and IT teams to ensure efficient and secure deployment of computational workloads, particularly in AI and data-intensive research projects.
If you are passionate about HPC and eager to contribute to groundbreaking research, we encourage you to apply.
Responsibilities:
Designs, integrates, delivers, and provides day-to-day maintenance to ensure subsystems and infrastructure function together and business needs are met. Provides operational support of applications or services, development operations (DevOps), through automation an/or infrastructure as code. Provides day-to-day maintenance and long-term assistance to ensure subsystems and infrastructure function together and business needs are met.
- 10% Communicates and coordinates with staff related to the project and/or system
- 40% Develops, programs, and/or deploys automation workflows for deployment, configuration, and/or monitoring of systems/services
- 10% Manages and maintains automation tools and infrastructure, including security configurations
- 20% Performs integration, migration, configuration, and security of existing applications and services into automated infrastructures
- 10% Provides general Linux-related troubleshooting and support. This may include troubleshooting supporting tools and services (e.g., ceph file system, firewall).
- 10% Implements and supports security controls to ensure the HPC protects data following HIPAA and relevant NIST guidance.
Institutional Statement on Diversity:
Diversity is a source of strength, creativity, and innovation for UW-Madison. We value the contributions of each person and respect the profound ways their identity, culture, background, experience, status, abilities, and opinion enrich the university community. We commit ourselves to the pursuit of excellence in teaching, research, outreach, and diversity as inextricably linked goals.
The University of Wisconsin-Madison fulfills its public mission by creating a welcoming and inclusive community for people from every background - people who as students, faculty, and staff serve Wisconsin and the world.
For more information on diversity and inclusion on campus, please visit: Diversity and Inclusion
Education:
Preferred Bachelor's Degree Preferably in computer science, data science, informatics, epidemiology, biostatistics, or related disciplines
Qualifications:
Required: - Minimum of 3 years of experience with High-Performance Computing (HPC) or High Throughput Computing (HTC) - Minimum of 2 years professional experience in automation, configuration management, package/patch management, and monitoring using Infrastructure as Code (IaC) tools (e.g., Ansible, Grafana, Prometheus, Terraform) - Proficiency in Linux/Unix systems and scripting languages (e.g., Bash, Python) - Experience with HPC workload management and job scheduling tools (e.g., Slurm, HTCondor) - Experience with HPC storage principles and managing distributed file systems (e.g., Ceph, Lustre, GlusterFS, BeeGFS) - Experience with Git version control system - Excellent oral and written communication skills
Preferred: - Experience with research and healthcare environments - Experience with Agile or Scrum methodology and tools - Experience with Continuous Integration/Continuous Delivery (CI/CD) pipelines - Experience with containers and container orchestration solutions (e.g., Apptainer, Docker, Kubernetes, Podman) - Experience supporting or deploying machine learning (ML/AI) workloads - Experience with NVIDIA data center class GPU hardware and software platforms (e.g., DGX, HGX, NGC) - Experience with regulatory or security regulations, controls, and benchmarks like HIPAA, NIST 800-53 and CIS - Familiarity with low-latency high-throughput network infrastructure (e.g., RoCE, InfiniBand) - Familiarity with shared and distributed memory parallelism (OpenMP, MPI) and GPUs
Strong communication and people skills are required. The ideal candidate is curious, self-motivated, and able to work with minimal supervision. The selected candidate must be committed to providing quality customer service.
Work Type:
Full Time: 100%
This position may require some work to be performed in-person, onsite, at a designated campus work location. Some work may be performed remotely, at an offsite, non-campus work location.
Appointment Type, Duration:
Ongoing/Renewable
Salary:
Minimum $100,000 ANNUAL (12 months) Depending on Qualifications The starting salary for the position is $100,000 but is negotiable based on experience and qualifications.
Employees in this position can expect to receive benefits such as generous vacation, holidays, and sick leave; competitive insurances and savings accounts; retirement benefits. Benefits information can be found at (https://hr.wisc.edu/benefits/).
SMPH Academic Staff Benefits flyer: (https://uwmadison.box.com/s/r50myohfvfd15bqltljn0g4laubuz7t0)
Additional Information:
This position is part of the Wisconsin Research, Innovation and Scholarly Excellence (RISE) Initiative. Through accelerated and strategic faculty hiring, research infrastructure enhancement, interdisciplinary collaboration, and increased student and educational opportunities, RISE addresses complex societal challenges of importance to the state, nation and world. Building on UW-Madison's strengths, RISE expands the University's successful track record of connecting with communities and industry on collaborative solutions.
Over the next three academic years, UW-Madison will substantially increase current research computing personnel and hardware infrastructure. Candidates hired through RISE will join a community of scholars working across disciplines, schools and colleges on research, teaching and outreach endeavors. The RISE community, and extended campus community, will engage regularly in venues such as seminar series and colloquia to share ongoing projects and identify opportunities to work together. The University will support the community, facilitating access to research infrastructure, and funding to support broad and rich collaboration.
Further information regarding RISE can be found at: https://rise.wisc.edu/
University sponsorship is not available for this position, including transfers of sponsorship. The selected applicant will be responsible for ensuring their continuous eligibility to work in the United States (i.e. a citizen or national of the United States, a lawful permanent resident, a foreign national authorized to work in the United States without the need of an employer sponsorship) on or before the effective date of appointment. This position is an ongoing position that will require continuous work eligibility. If you are selected for this position you must provide proof of work authorization and eligibility to work.
How to Apply:
To apply for this position, please click on the "Apply Now" button. You will be asked to upload a current resume/CV and a cover letter briefly describing your qualifications and experience.
Contact:
Cody Roekle croekle@wisc.edu 608-263-7676 Relay Access (WTRS): 7-1-1. See RELAY_SERVICE for further information.
Official Title:
DevOps Engineer II(IT106)
Department(s):
A53-MEDICAL SCHOOL/Informatics and IT
Employment Class:
Academic Staff-Renewable
Job Number:
312770-AS
The University of Wisconsin-Madison is an Equal Opportunity and Affirmative Action Employer.
Qualified applicants will receive consideration for employment without regard to, including but not limited to, race, color, religion, sex, sexual orientation, gender identity, national origin, age, pregnancy, disability, or status as a protected veteran and other bases as defined by federal regulations and UW System policies. We promote excellence through diversity and encourage all qualified individuals to apply. For more information regarding applicant and employee rights and to view federal and state required postings, click here
If you need to request an accommodation because of a disability, you can find information about how to make a request at the following website: https://employeedisabilities.wisc.edu/disability-accommodation-information-for-applicants/
Employment may require a criminal background check. It may also require you and your references to answer questions regarding sexual violence and sexual harassment.
The University of Wisconsin System will not reveal the identities of applicants who request confidentiality in writing, except that the identity of the successful candidate will be released. See Wis. Stat. sec. 19.36(7).
The Annual Security and Fire Safety Report contains current campus safety and disciplinary policies, crime statistics for the previous 3 calendar years, and on-campus student housing fire safety policies and fire statistics for the previous 3 calendar years. UW-Madison will provide a paper copy upon request; please contact the University of Wisconsin Police Department.
|