Your browser does not support javascript! Please enable it, otherwise web will not work for you.

HPC Software Development Professional @ Hewlett Packard

Home > Devops






 HPC Software Development Professional

Job Description

HPC Software Development
This role has been designed as Onsite with an expectation that you will primarily work from an HPE office.
Who We Are:
Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today s complex world. Our culture thrives on finding new and better ways to accelerate what s next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.
Job Description:
High Performance Computing, AI and Labs is a critical element of HPE. We are focused on delivering innovative solutions that accelerate our customers digital transformation, enabling them to tackle their complex, and data-intensive workloads. Combining deep expertise and the development of the world s most cutting-edge, high-performance supercomputers, is defining the next era of computing delivering valuable insight innovation. Join us and redefine what s next for you.
What youll do:
You are a dynamic, driven professional with a passion for success yours, your company s and your customer s. You bring knowledge and expertise in high performance computing, cloud computing or related technical fields, strong communication and collaboration skills, and you always conduct yourself with the highest professionalism and integrity.
Responsibilities:
  • Install, configure, and maintain HPC systems , including compute, storage, and management nodes.
  • Manage HPC lab environments , ensuring availability and reliability for software development, testing, and validation activities.
  • Perform software installations, upgrades, and patch management for HPC environments.
  • Troubleshoot and resolve hardware and system-level issues in collaboration with vendors and internal teams.
  • Develop and maintain automation frameworks for system installation, configuration, and monitoring using Python , Ansible , or similar tools.
  • Monitor system performance, analyze logs, and proactively identify and resolve performance bottlenecks.
  • Implement best practices for system security, backup, and recovery .
  • Maintain system documentation, including architecture diagrams, configuration details, and operational procedures.
  • Collaborate with cross-functional teams to support ongoing HPC projects and system upgrades.
  • Contribute to continuous improvement initiatives in system reliability, scalability, and automation.
What you need to bring:
  • Bachelors or Masters engineering degree in Computer Science, Information Systems.
  • Typically 10+ years experience.
  • Programming experience with scripting ( Python or Bash )
  • C/C++ experience
  • Strong experience with Linux/Unix system administration (Red Hat, SUSE, or similar).
  • Knowledge of HPC architectures , cluster management, and resource scheduling tools (e.g., Slurm, PBS, LSF).
  • Experience in system provisioning and configuration automation using Python , Ansible , or similar frameworks.
  • Proficiency in hardware troubleshooting , including node, interconnect, and storage components.
  • Familiarity with storage management systems (e.g., NFS, Lustre, GPFS).
  • Understanding of networking fundamentals (TCP/IP, DNS, VLANs).
  • Experience with system monitoring tools such as Prometheus, Grafana, or Nagios.
  • Ability to develop and maintain automated workflows for system deployment and management.
  • Familiarity with CI/CD concepts and infrastructure as code (IaC) practices is a plus.
  • Technical curiosity, willingness to learn new technologies
  • Strong communication skills when talking about technical concepts
  • Able to collaborate and communicate well with colleagues across geographies
Nice to have :
  • Parallel Programming - thread safe programming
  • Docker Kubernetes
  • VM experience
  • Networking experience (Infiniband / Slingshot / Ethernet / etc)
  • SQL knowledge
  • Agile/Scrum experience
  • AI development tools (such as Github copilot)
 

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Hewlett Packard
Location(s): Bengaluru

+ View Contactajax loader


Keyskills:   Unix TCP Patch management Cloud computing C++ Claims Networking Linux Ethernet SQL

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Senior Systems Engineer - DevOps and Development

  • CGI
  • 4 - 7 years
  • Bengaluru
  • 4 days ago
₹ Not Disclosed

Lead Software Engineer, Cloud Site Reliability (SRE)

  • Icertis
  • 7 - 12 years
  • Pune
  • 5 days ago
₹ Not Disclosed

Software Engineer, Cloud Site Reliability (SRE)

  • Icertis
  • 6 - 11 years
  • Pune
  • 7 days ago
₹ Not Disclosed

Professional Services Delivery Engineer Iii, Specialty Domain - In

  • Rackspace Technology
  • 5 - 7 years
  • Noida, Gurugram
  • 8 days ago
₹ Not Disclosed

Hewlett Packard

Hewlett Packard Enterprise is an industry leading Technology Company that enables customers to go further, faster. With the industry’s most comprehensive portfolio, spanning the cloud to the data center to workplace applications, our technology and services help customers around the world mak...