Oracle recently released a new version of its Autonomous Health Framework (AHF). At the time of writing the version is 22.3.1.
As the development is quite active there, one of the new features is AHF Insights with a nice description:
AHF Insights provides deeper diagnostic insights into Oracle diagnostic collections collected by AHF diagnostic utilities, Oracle EXAchk, Oracle Trace File Analyzer, Exawatcher, and Cluster Health Monitor.
This sounds promising, so I was tempted to give it a try!
My first attempt failed: I gave it a chance at my sandbox in my laptop, but right now AHF Insights only work on Engineered Systems. I didn't found this documented somewhere - but ok. The good news: I was told AHF Insights will be available in future versions also for non-Engineered environments. As this is the first release, I can follow the argument here.
So I had to grab an Engineered System machine, install AHF, let it run for some time and then just run
ahf analysis create --type insights
.
└── web
├── css
│ ├── alta
│ │ └── 11.1.1
│ │ └── web
│ │ ├── fonts
│ │ └── images
│ │ └── sprites
│ ├── fonts
│ ├── images
│ └── redwood
│ └── 11.1.1
│ ├── common
│ └── web
│ ├── fonts
│ └── images
├── dynamicHtml
├── icons
├── js
│ ├── jsons
│ │ ├── insights
│ │ ├── meta
│ │ └── topology
│ ├── libs
│ │ ├── oj
│ │ │ └── v11.1.1
│ │ │ └── resources
│ │ │ └── root
│ │ └── require
│ ├── viewModels
│ └── views
│ ├── compliance
│ │ └── js
│ │ └── input
│ ├── dbParameter
│ │ └── js
│ │ └── input
│ ├── ganttPage
│ │ └── js
│ │ └── input
│ ├── os
│ │ └── js
│ │ └── input
│ └── rpm
│ └── js
│ └── input
└── log
- the AHF version
- the time range it covers
- System Topology
- Insights it found
The Cluster information contains basic details about the cluster itself, it's resources and the ASM disks.
The summary information has anCopy as text button which generates a nice output:
========================================================================================================================
Area | Dimension | Value
========================================================================================================================
System | Type | Exadata
System | Total Node Count | 2
Cluster | Node Count | 1
Cluster | GI Version | 19.17.0.0.0
Cluster | Timezone | Europe/
Cluster | Cluster Name |
Cluster | CRS Home | /u01/app/19.0.0.0/grid
Databases | Count | 1
Databases | Database Home | /u02/
Database Servers | Count | 2
Database Servers | Hardware Model | Standard PC (Q35 + ICH9, 2009)
Database Servers | Image Version | 21.2.13.0.0.220602
Database Servers | Operating System | Linux x86_64
Database Servers | Operating System Version | 4.14.35-2047.511.5.5.1.el7uek.x86_64
========================================================================================================================
The Timeline gives a graph of all relevant events, in my case there are only 2 types of events on one hostname. Below there are filters which can be used if many events appear on the timeline. Also a list of Timestamp, Type, Event, Hostname and Description of each (filtered) event is shown.
If you move the mouse to the right top of the chart, a list of icons becomes visible for an interactive interaction with the graph:
This can be used in all charts in the Insights.
In the Operating System Issues Page, a lot of detailed information about CPU, Memory, IO, Network, and Processes is shown, both as Charts in Metrics and Grouped Details in the Reports section.
A nice detail in Process Aggregation is the aggregation on instance basis - so I can check the aggregated memory, processes, etc. for each instance.
In the Best Practices a list of all the checks, their findings and recommendations is given.
I always be careful with "Best Practices" - in this particular case the underscore parameters were given by Oracle support in a specific service request.
But of course, you can see them as recommendations, either follow them or at least document the specific reason you did NOT follow them for a given purpose.
In the last section, Recommended Software, you can see the early stage of the software.
For me, 19.17.0.0.0 is close enough to 19.17.0.0.221018.
Maybe in a future release, Oracle will agree ;)
At all, I can highly recommend to use AHF Insights whenever you need to investigate an issue which might have a wider scope than one single SQL or instance. Even as a first release, it can provide a lot of useful information in a nice and well structured report.