From the professional point of view of our operation and maintenance, this incident is completely an avoidable accident. It is estimated that the tendering agency should not apply "intelligent operation and maintenance" to system maintenance, or even realize the importance of intelligent operation and maintenance.
In the past, the traditional centralized monitoring method only focused on collecting alarm information, and the chaotic alarm events filled with redundant information made the process always on the way from one emergency treatment to another. Especially in the current Internet era, the explosion of data makes the efficiency of traditional technology and management methods plummet, and the difficulty of operation and maintenance management gradually increases, so the system crash and downtime are also expected.
Under this urgent demand, it is imperative to empower traditional operation and maintenance by means of artificial intelligence, and AIOps intelligent operation and maintenance came into being. AIOps intelligent operation and maintenance adopts advanced AI technology, giving full play to machine learning ability, assisting operation and maintenance personnel to improve operation and maintenance efficiency, greatly saving labor costs for enterprises and escorting business.
Especially in alarm, the risks brought by system changes may be inevitable, but there can be early warning and faster root location. Because the processing and analysis of operation and maintenance data has its special requirements, not only the data scale is large, but also the timeliness of data processing is extremely high. This is because a lot of operation and maintenance data need to be aggregated, calculated, judged, compared and other complex operations in a high-speed traffic engine to meet the requirements of machine learning algorithms, which is also the characteristic of the operation and maintenance work scene, that is, it must be "fast", otherwise once the fault occurs for a long time, everything loses its analytical significance.
Intelligent operation and maintenance is a brand-new digital operation and maintenance capability, and it will also be an essential capability for digital transformation. Compared with the traditional operation and maintenance mode, intelligent operation and maintenance can improve efficiency in four aspects: operation and maintenance data governance, business digitalization risk, operation and maintenance labor cost and business influence.
In terms of alarms,
1.? Can find the problem clues more timely and effectively with lower labor cost, and improve the business support ability;
2.? Able to deeply understand and analyze alarms and improve troubleshooting efficiency;
3.? We can use the wisdom of man-machine integration to establish a continuous improvement mechanism, and provide guidance for further intelligent transformation in other fields such as basic index monitoring and log analysis.
Intelligent operation and maintenance is in full swing. Gartner foresees that IT will be the next generation of operation and maintenance, and thinks that by 2022, more than 50% enterprises in the world will use AIOps to replace the traditional IT operation and maintenance management means.
The spreading epidemic is not only a test of epidemic prevention and control measures, but also a test of smart cities. In the tide of enterprise digital transformation, "intelligence" is what operation and maintenance should look like.