Colección SciELO Chile

Departamento Gestión de Conocimiento, Monitoreo y Prospección
Consultas o comentarios: productividad@anid.cl
Búsqueda Publicación
Búsqueda por Tema Título, Abstract y Keywords



Discovery of Cloud Incidents through Streaming Consolidation of Events across Timeline and Topology Hierarchy
Indexado
WoS WOS:001270140300066
Scopus SCOPUS_ID:85198401121
DOI 10.1109/NOMS59830.2024.10575213
Año 2024
Tipo proceedings paper

Citas Totales

Autores Afiliación Chile

Instituciones Chile

% Participación
Internacional

Autores
Afiliación Extranjera

Instituciones
Extranjeras


Abstract



With the growing complexity and dynamism of cloud environments, users of operations management solutions are facing a critical headache of "event storms". Understanding and prioritizing reactions to such high volumes of noisy recommendation content for various tasks is beyond the capacities of human operators. This significantly degrades the resolution metrics of performance issues and optimization of infrastructures and applications. We have devised a novel streaming clustering algorithm for processing alerts and discovering Alert Episodes with their evolution tracked in time and space. It is based on the principles of the classical density-based clustering DBSCAN. We learn Unknown Problems applying this algorithm to low-level events within the VMware Aria Operations manager. Those episodes might typically be out of alert definitions coverage and explain new types of emerging incidents. Our solutions with different hyperparameters are prototyped and integrated into the production. We share experimental insights from an internal environment with interesting alert episodes learned and unknown problems of alarms/symptoms discovered with a self-explainable story on where the source of the performance issue stays and how it evolved into a larger problem situation affecting several objects and hierarchy layers. The constructs we introduce help reduce user efforts in making sense of events' waves and perform troubleshooting with relevance. Our solution can be refactored into an independent event management service for cloud operations.

Métricas Externas



PlumX Altmetric Dimensions

Muestra métricas de impacto externas asociadas a la publicación. Para mayor detalle:

Disciplinas de Investigación



WOS
Sin Disciplinas
Scopus
Sin Disciplinas
SciELO
Sin Disciplinas

Muestra la distribución de disciplinas para esta publicación.

Publicaciones WoS (Ediciones: ISSHP, ISTP, AHCI, SSCI, SCI), Scopus, SciELO Chile.

Colaboración Institucional



Muestra la distribución de colaboración, tanto nacional como extranjera, generada en esta publicación.


Autores - Afiliación



Ord. Autor Género Institución - País
1 Harutyunyan, Ashot - Yerevan State Univ - Armenia
NAS RA - Armenia
Yerevan State University - Armenia
2 Poghosyan, Arnak - NAS RA - Armenia
Institute of Mathematics of the National Academy of Sciences of the Republic of Armenia - Armenia
3 Bunarjyan, Tigran - TECH UNIV MUNICH - Alemania
Technische Universität München - Alemania
4 Grigoryan, Naira - VMware - Armenia
VMware, Inc - Estados Unidos
5 Grigoryan, Artur - VMware - Armenia
VMware, Inc - Estados Unidos
6 Tadevosyan, Vahan - VMware - Armenia
VMware, Inc - Estados Unidos
7 Baloian, Nelson - Universidad de Chile - Chile
8 Hong, JWK -
9 Seok, SJ -
10 Nomura, Y -
11 Wang, YC -
12 Choi, BY -
13 Kim, MS -
14 Riggio, R -
15 Tsai, MH -
16 DosSantos, CRP -

Muestra la afiliación y género (detectado) para los co-autores de la publicación.

Financiamiento



Fuente
Sin Información

Muestra la fuente de financiamiento declarada en la publicación.

Agradecimientos



Agradecimiento
Sin Información

Muestra la fuente de financiamiento declarada en la publicación.