The world is trying to move to devops slowly. Today , If we take OMS side, who ever implements the product to a customer, they sell them by showing the ready to deploy code , development accelerators which helps customer to go to market faster with changes,
but these were focusing only on development side. There was absolutely nothing to strengthen ops side in the market to have it ready to be deployed from Day-1.
Why not, IBM creates one such product. An additional product( on additional licensing/ship along with OMS) that focus on mainly on monitoring . Strengthen customer ops by monitoring product, business ,infra.
Adding any parameter to monitor in this product should be like a plug and play . IBM can consider mobile first as well here.
Today , most of the monitoring on the Business happens through the data from OMS DB. Sometimes even some of the infra parameters like Agents, App, Ints health and load of the JVMs are measured via DB metrics which are updated through healthmonitors/APIs itself .
How it happens most of the time from Vendor/Customer side is through cron. A DB query is fired with cron job and data is prepared in-form of xls/any other format and sent to the Ops group to check further.
The idea here is:
So IBM can develop a product as mobile app / add a dedicated page in SST to monitor (for OMOC) / Stand alone product
-> custom DB query that they need to run at X interval.
-> Maintain the history of the data . Like last 4/5 DB sweeps with TS
-> Report the anamoly if the data do not grow as expected / do not satisfy the minimum base line
Ex: Order volume for every 1 hour should grow X number . If not report anomaly
Schedule agent should process minimum X orders in the last 1 hr . If not report anomaly
backorder volume in last 15 min is higher than the slab. Report anomaly.
Run a query and if there is one data found , consider that as anomaly and send report
The volume that the product monitor against should be at hourly level (or) some interval level. THis is to account off peak / peak volume
-> Ops should be able to delete any of the monitoring / Switch off a specific monitor temporarily for time being.
-> IBM can move their INFRA monitoring that they show in SST to here . For OPS, It is one stop solution. They get everything here.
-> I presented this idea with some prototype in my organisation. I developed this as mobile App on flutter. I attached the video of the app that I developed . please go through let me know for further information
->Developing this ops side monitoring tool as APP will just fit in todays need. Instead of emails, a push notification will be sent to the team . This way , the anomaly reach the team at the right time. After all, time is crucial in Ops