You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* improvements to anomalies page
* little improvements
* replace screenshots
* Apply suggestions from code review
Co-authored-by: domalessi <[email protected]>
* tag influence and usage unit
* combine with jonathan's PR
* minor updates and image replacements
---------
Co-authored-by: domalessi <[email protected]>
Copy file name to clipboardExpand all lines: content/en/cloud_cost_management/anomalies.md
+75-22Lines changed: 75 additions & 22 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,7 +3,7 @@ title: Anomalies Page
3
3
further_reading:
4
4
- link: "/cloud_cost_management/"
5
5
tag: "Documentation"
6
-
text: "Cloud Cost Management"
6
+
text: "Learn about Cloud Cost Management"
7
7
- link: "/cloud_cost_management/monitors"
8
8
tag: "Monitors"
9
9
text: "Create Cost Monitors"
@@ -13,49 +13,102 @@ further_reading:
13
13
14
14
Datadog Cloud Cost Management (CCM) continuously monitors your environment to detect and prioritize unexpected cost changes, enabling you to share, investigate, and resolve anomalies. Cost anomalies are available for AWS, and do not require any additional setup after CCM is set up.
15
15
16
+
{{< img src="cloud_cost/anomalies/anomalies-overview.png" alt="List of cost anomalies showing service names, usage types, and cost impacts" style="width:80;" >}}
17
+
18
+
A typical workflow could be the following:
19
+
20
+
1.**View** anomalies on the Anomalies tab
21
+
2.**Investigate** using Watchdog Explains to understand what's driving the cost changes
22
+
3.**Share with engineering teams** who can take action by reviewing details, investigating further, or setting up monitoring
23
+
4.**Resolve** anomalies that are expected or not significant
24
+
16
25
## How anomalies are defined
17
26
18
27
Anomalies are significant, unexpected changes that stand out from typical patterns. Datadog automatically identifies anomalies using machine learning techniques that adapt to your specific usage patterns.
19
28
20
-
Our algorithm takes into account seasonality, distinguishing true anomalies from expected fluctuations. For example, if your infrastructure typically scales up every Monday, Datadog recognizes this pattern and does not flag it as a cost anomaly. The algorithm also filters into usage charges only (excludes taxes, credits, refunds, RIFee). This is so that anomalies are based on engineering's usage that can be actioned on.
29
+
To distinguish between true anomalies and expected fluctuations, Datadog's algorithm:
30
+
- Recognizes recurring cost spikes and dips, such as a cost increase every Monday
31
+
- Focuses on engineering usage (excludes taxes, credits, refunds, and Reserved Instance fees)
32
+
- Filters out low-impact anomalies to reduce noise
33
+
34
+
## View cost anomalies
21
35
22
-
To further reduce noise, anomalies with low cost impact are automatically filtered out, ensuring your attention stays focused on the largest anomalies.
36
+
On the [Anomalies tab of the Cloud Cost page in Datadog][1], you can view and filter anomalies:
-**Active**: Anomalies from the last full day of cost data (typically 2-3 days prior).
39
+
-**Past**: Anomalies that lasted more than 7 days or are no longer detected as anomalous. Past anomalies can be useful to report on, but are often less urgent and actionable.
40
+
-**Resolved**: Anomalies that you've marked as resolved with context.
25
41
26
-
## View cost anomalies
42
+
Each anomaly card shows:
43
+
- Service name (`rds`, for example)
44
+
- Usage type
45
+
- Cloud accounts affected
46
+
- Expected vs. actual costs
47
+
- Cost trend graph (past 1 month)
48
+
49
+
Anomalies are sorted by cost impact, with the most significant changes at the top.
27
50
28
-
On the [Anomalies tab of the Cloud Cost page in Datadog][1], you can view the anomalies and filter them to Active, Past, or Resolved:
29
-
-**Active**: An anomaly is **Active** if anomalous costs occurred in the last day (the last full day of cost data, which is often 2-3 days prior).
30
-
-**Past**: If an anomaly lasts more than 7 days, or the algorithm detects that costs are no longer anomalous, the anomaly is moved to the **Past** tab. Past anomalies can be useful to report on, but are often less urgent and actionable.
31
-
-**Resolved**: If you [resolve an anomaly](#resolve-anomalies), the anomaly can be found in the **Resolved** tab.
51
+
## Investigate anomalies
32
52
33
-
Each anomaly explains how much more costs were than expected for the service name (ex:'rds'), usage type, and cloud accounts. Additionally, the anomaly also shows what expected costs would have been in the time frame. The anomaly card shows a graph with the cost trend over the past 1 month.
53
+
### Understand what drives anomalies
34
54
35
-
Anomalies with the most unexpected costs are at the top, so that it is easier to take action on anomalies with the most impact first.
55
+
CCM automatically uses [Watchdog Explains][2], an investigation assistant, to help you identify what is driving cost anomalies. Watchdog Explains analyzes and identifies the specific:
36
56
37
-
This is an example of the list of anomalies detected in your infrastructure:
where the anomaly happened, reducing manual investigation steps. When hovering over the anomaly graph, you can see two graphs: one with and one without the tags identified by Watchdog Explains. This shows how removing specific tags flattens the spike, confirming the impact on the cost.
40
64
41
-
## Take action on anomalies
65
+
###Take action on anomalies
42
66
43
-
Click an anomaly to view the services, teams, environments, and resource IDs that may be driving the cost anomaly.
67
+
Follow these steps to investigate and resolve anomalies:
44
68
45
-
Investigate the anomaly further, and by any additional dimensions, by viewing the costs in Explorer or saving the query to a Notebook. You can also send the anomaly, Explorer link, or Notebook to the associated service owners or teams. This enables teams to provide context for why the anomaly occurred, and if it's expected.
69
+
1.**Hover** over an anomalyto see anomaly drivers or click **See more**to open the side panel.
46
70
47
-
You can also create a cost anomaly monitor to get alerted of similar cost anomalies in the future.
71
+
{{< img src="cloud_cost/anomalies/anomalies-watchdog.png" alt="Click See More to see side panel showing anomaly details, investigation options, and action buttons" style="width:80;" >}}
48
72
49
-
This is the side panel where you can take action on your cost anomaly:
73
+
1.**Review the details** for services affected, teams involved, environments impacted, resource IDs, or how usage and unit price may be driving the cost anomaly.
74
+
1.**Investigate further**: View the anomaly in Cost Explorer or a Datadog Notebook to further investigate anomalies by using additional dimensions. You can then send the anomaly, Explorer link, or Notebook to the service owners or teams identified by Watchdog Explains. This enables teams to resolve anomalies with context for why the anomaly occurred and whether it's expected.
50
75
51
-
{{< img src="cloud_cost/anomalies/cost_anomalies_side-panel.png" alt="In the side panel you can take action on your cost anomaly" style="width:90%;" >}}
76
+
{{< img src="cloud_cost/anomalies/anomalies-take-action.png" alt="Click Take Action to view the anomaly in Cost Explorer or add it to a Notebook" style="width:80;" >}}
77
+
1.**Set up monitoring**: Create a cost anomaly monitor for similar patterns or configure alerts for future anomalies.
78
+
{{< img src="cloud_cost/anomalies/anomalies-create-monitor.png" alt="Create a cost anomaly monitor" style="width:80;" >}}
52
79
53
80
## Resolve anomalies
54
81
55
-
As you investigate anomalies, you may find anomalies that are not significant, were actually expected costs, or are otherwise not considered an anomaly.
82
+
As you investigate anomalies, you may find some that are not significant, were actually expected costs, or are otherwise not considered anomalies.
56
83
57
-
Mark anomalies as significant or insignificant to give feedback and help improve the anomaly detection algorithm. Resolve anomalies with context to move anomalies to the Resolved tab, and add context for others in your organization.
84
+
To resolve an anomaly:
85
+
86
+
1. Click **Resolve Anomaly** to open the resolution popup.
87
+
1. Select one of the following resolutions to help improve the algorithm:
88
+
- The anomaly amount was too small
89
+
- This is an unexpected increase
90
+
- This is an expected increase
91
+
1.**Add context** about why it is or is not an anomaly.
92
+
1. Click **Resolve** to move it to the Resolved tab.
58
93
59
94
This is an example of how to mark a cost anomaly as significant and explain why it's an anomaly:
60
95
61
-
{{< img src="cloud_cost/anomalies/cost_anomalies_side-panel_is-significant.png" alt="You can mark cost anomaly as significant and write an explanation about why it's an anomaly and then resolve it" style="width:90%;" >}}
96
+
{{< img src="cloud_cost/anomalies/cost_anomalies_side-panel_is-unexpected-1.png" alt="Form for marking an anomaly as unexpected with explanation field" style="width:80;" >}}
97
+
98
+
## Troubleshooting
99
+
100
+
If you're not seeing expected anomalies:
101
+
- Verify that CCM is [properly set up][3]
102
+
- Check that you have the necessary permissions for AWS
0 commit comments