Basic Computing Services (subMIT) Review
Thursday 22 May 2025, 13:00
→
15:35
America/New_York
MIT
MIT
Description
Zoom connection available at
https://mit.zoom.us/j/96743699673?pwd=b3h2Q3c3cVQwYW12blhMUG5SWXZCZz09
13:00
→
13:10
Opening Remarks from the Steering Committee
10m
Speaker
:
Christoph Paus
(MIT)
13:10
→
13:20
Overview: The purpose and impact of SubMIT
10m
What is the problem we are trying to solve
System usage: total and weekly users, by department etc. …
Web page, Paper on SubMIT, Publications with SubMIT, …
Speaker
:
David Walter
13:20
→
13:30
User Workflows on SubMIT
10m
Account creation and login
Access through JupyterHub or terminal
Conda, Containers, singularity
Batch computing using slurm, htcondor
External resources and how to access them
Speaker
:
Luca Lavezzo
(MIt)
13:30
→
13:40
Hardware resources and performance
10m
Hardware resources, compute, network, ...
status, capacity, usage, ...
What resources make SubMIT attractive
Benchmarking of the system, analysis challenge
Speaker
:
Mariarosaria D'Alfonso
(Massachusetts Institute of Technology)
13:40
→
13:50
User support
10m
Communication channels: Stack, email, …
Chatbot
User's guide
Emails to tickets analysis
Speaker
:
Marianne Moore
(MIT)
13:50
→
14:00
Break
10m
14:00
→
14:10
Engagement with the user community
10m
SubMIT workshop, tutorials, user meetings
Classroom usage, workshops hold at MIT using SubMIT resources
How the community evolves/grows
Speaker
:
Matthew Heine
(Massachusetts Institute of Technology)
14:10
→
14:20
Customization and user requests
10m
Specific user requests:
OpenMPI, Mathematica, Globus
Groups: priority access on purchased hardware, storages, webpage
Dropbox like storage? We didn’t follow up on that
Current limitations and open challenges
Balancing restrictions/rules with fair share usage
Speaker
:
Xuejian(Jacob) Shen
(Massachusetts Institute of Technology)
14:20
→
14:30
Stable long term operations & outlook
10m
What did we learn the last year(s)
Software upgrades policy
The future of SubMIT
Control groups (partially done): Limit abuse of the system, include CephFS machines in Slurm pool, …
Take control over LDAP server (This will allow us to…)
Removal of old data (/ceph)
Speaker
:
Zhangqier Wang
(Massachusetts Institute of Technology)
14:30
→
14:40
Discussion & feedback
10m