Optically Interconnected Disaggregated Datacenters in Support of ML/AI Applications: a Failure Analysis (W2A.43)
Presenter: Albert Pagès, Universitat Politècnica de Catalunya
Disaggregated datacenters are promising solutions for executing ML applications. One crucial aspect is the application resilience against infrastructure failures. We analyze application affectation and disruption rates in front of various failure patterns.
Authors:Albert Pagès, Universitat Politècnica de Catalunya / Fernando Agraz, Universitat Politècnica de Catalunya / Salvatore Spadaro, Universitat Politècnica de Catalunya