Kubernetes, Kubeflow, Docker, Podman, Singularity, AWS Parallel Cluster administration and management, SLURM for complex workflows and resource management, Software stack building, application optimization, PyTorch, Horovod, MS DeepSpeed, Ray, NVIDIA RAPIDS, CUDA programming, Profiling and debugging with ARM Forge, Cray tools, DASK, high-performance data processing, Cray systems (XC40, XC30), HPE Cray EX, IBM Blue Gene, Python, C/C++, Bash scripting, MPI, OpenMP, OpenACC, CUDA, High-performance interconnects, cluster networking