Skip to content

Commit 54f377f

Browse files
rtrieudomalessi
authored andcommitted
CCM Anomalies updates (#29624)
* improvements to anomalies page * little improvements * replace screenshots * Apply suggestions from code review Co-authored-by: domalessi <[email protected]> * tag influence and usage unit * combine with jonathan's PR * minor updates and image replacements --------- Co-authored-by: domalessi <[email protected]>
1 parent 8218228 commit 54f377f

9 files changed

+75
-22
lines changed

content/en/cloud_cost_management/anomalies.md

Lines changed: 75 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ title: Anomalies Page
33
further_reading:
44
- link: "/cloud_cost_management/"
55
tag: "Documentation"
6-
text: "Cloud Cost Management"
6+
text: "Learn about Cloud Cost Management"
77
- link: "/cloud_cost_management/monitors"
88
tag: "Monitors"
99
text: "Create Cost Monitors"
@@ -13,49 +13,102 @@ further_reading:
1313

1414
Datadog Cloud Cost Management (CCM) continuously monitors your environment to detect and prioritize unexpected cost changes, enabling you to share, investigate, and resolve anomalies. Cost anomalies are available for AWS, and do not require any additional setup after CCM is set up.
1515

16+
{{< img src="cloud_cost/anomalies/anomalies-overview.png" alt="List of cost anomalies showing service names, usage types, and cost impacts" style="width:80;" >}}
17+
18+
A typical workflow could be the following:
19+
20+
1. **View** anomalies on the Anomalies tab
21+
2. **Investigate** using Watchdog Explains to understand what's driving the cost changes
22+
3. **Share with engineering teams** who can take action by reviewing details, investigating further, or setting up monitoring
23+
4. **Resolve** anomalies that are expected or not significant
24+
1625
## How anomalies are defined
1726

1827
Anomalies are significant, unexpected changes that stand out from typical patterns. Datadog automatically identifies anomalies using machine learning techniques that adapt to your specific usage patterns.
1928

20-
Our algorithm takes into account seasonality, distinguishing true anomalies from expected fluctuations. For example, if your infrastructure typically scales up every Monday, Datadog recognizes this pattern and does not flag it as a cost anomaly. The algorithm also filters into usage charges only (excludes taxes, credits, refunds, RIFee). This is so that anomalies are based on engineering's usage that can be actioned on.
29+
To distinguish between true anomalies and expected fluctuations, Datadog's algorithm:
30+
- Recognizes recurring cost spikes and dips, such as a cost increase every Monday
31+
- Focuses on engineering usage (excludes taxes, credits, refunds, and Reserved Instance fees)
32+
- Filters out low-impact anomalies to reduce noise
33+
34+
## View cost anomalies
2135

22-
To further reduce noise, anomalies with low cost impact are automatically filtered out, ensuring your attention stays focused on the largest anomalies.
36+
On the [Anomalies tab of the Cloud Cost page in Datadog][1], you can view and filter anomalies:
2337

24-
[1]: https://app.datadoghq.com/cost/analyze/anomalies
38+
- **Active**: Anomalies from the last full day of cost data (typically 2-3 days prior).
39+
- **Past**: Anomalies that lasted more than 7 days or are no longer detected as anomalous. Past anomalies can be useful to report on, but are often less urgent and actionable.
40+
- **Resolved**: Anomalies that you've marked as resolved with context.
2541

26-
## View cost anomalies
42+
Each anomaly card shows:
43+
- Service name (`rds`, for example)
44+
- Usage type
45+
- Cloud accounts affected
46+
- Expected vs. actual costs
47+
- Cost trend graph (past 1 month)
48+
49+
Anomalies are sorted by cost impact, with the most significant changes at the top.
2750

28-
On the [Anomalies tab of the Cloud Cost page in Datadog][1], you can view the anomalies and filter them to Active, Past, or Resolved:
29-
- **Active**: An anomaly is **Active** if anomalous costs occurred in the last day (the last full day of cost data, which is often 2-3 days prior).
30-
- **Past**: If an anomaly lasts more than 7 days, or the algorithm detects that costs are no longer anomalous, the anomaly is moved to the **Past** tab. Past anomalies can be useful to report on, but are often less urgent and actionable.
31-
- **Resolved**: If you [resolve an anomaly](#resolve-anomalies), the anomaly can be found in the **Resolved** tab.
51+
## Investigate anomalies
3252

33-
Each anomaly explains how much more costs were than expected for the service name (ex:'rds'), usage type, and cloud accounts. Additionally, the anomaly also shows what expected costs would have been in the time frame. The anomaly card shows a graph with the cost trend over the past 1 month.
53+
### Understand what drives anomalies
3454

35-
Anomalies with the most unexpected costs are at the top, so that it is easier to take action on anomalies with the most impact first.
55+
CCM automatically uses [Watchdog Explains][2], an investigation assistant, to help you identify what is driving cost anomalies. Watchdog Explains analyzes and identifies the specific:
3656

37-
This is an example of the list of anomalies detected in your infrastructure:
57+
- accounts
58+
- teams
59+
- services
60+
- Kubernetes or ECS clusters
61+
- regions
3862

39-
{{< img src="cloud_cost/anomalies/cost_anomalies_new_list.png" alt="List of cost anomalies automatically detected" style="width:90%;" >}}
63+
where the anomaly happened, reducing manual investigation steps. When hovering over the anomaly graph, you can see two graphs: one with and one without the tags identified by Watchdog Explains. This shows how removing specific tags flattens the spike, confirming the impact on the cost.
4064

41-
## Take action on anomalies
65+
### Take action on anomalies
4266

43-
Click an anomaly to view the services, teams, environments, and resource IDs that may be driving the cost anomaly.
67+
Follow these steps to investigate and resolve anomalies:
4468

45-
Investigate the anomaly further, and by any additional dimensions, by viewing the costs in Explorer or saving the query to a Notebook. You can also send the anomaly, Explorer link, or Notebook to the associated service owners or teams. This enables teams to provide context for why the anomaly occurred, and if it's expected.
69+
1. **Hover** over an anomaly to see anomaly drivers or click **See more** to open the side panel.
4670

47-
You can also create a cost anomaly monitor to get alerted of similar cost anomalies in the future.
71+
{{< img src="cloud_cost/anomalies/anomalies-watchdog.png" alt="Click See More to see side panel showing anomaly details, investigation options, and action buttons" style="width:80;" >}}
4872

49-
This is the side panel where you can take action on your cost anomaly:
73+
1. **Review the details** for services affected, teams involved, environments impacted, resource IDs, or how usage and unit price may be driving the cost anomaly.
74+
1. **Investigate further**: View the anomaly in Cost Explorer or a Datadog Notebook to further investigate anomalies by using additional dimensions. You can then send the anomaly, Explorer link, or Notebook to the service owners or teams identified by Watchdog Explains. This enables teams to resolve anomalies with context for why the anomaly occurred and whether it's expected.
5075

51-
{{< img src="cloud_cost/anomalies/cost_anomalies_side-panel.png" alt="In the side panel you can take action on your cost anomaly" style="width:90%;" >}}
76+
{{< img src="cloud_cost/anomalies/anomalies-take-action.png" alt="Click Take Action to view the anomaly in Cost Explorer or add it to a Notebook" style="width:80;" >}}
77+
1. **Set up monitoring**: Create a cost anomaly monitor for similar patterns or configure alerts for future anomalies.
78+
{{< img src="cloud_cost/anomalies/anomalies-create-monitor.png" alt="Create a cost anomaly monitor" style="width:80;" >}}
5279

5380
## Resolve anomalies
5481

55-
As you investigate anomalies, you may find anomalies that are not significant, were actually expected costs, or are otherwise not considered an anomaly.
82+
As you investigate anomalies, you may find some that are not significant, were actually expected costs, or are otherwise not considered anomalies.
5683

57-
Mark anomalies as significant or insignificant to give feedback and help improve the anomaly detection algorithm. Resolve anomalies with context to move anomalies to the Resolved tab, and add context for others in your organization.
84+
To resolve an anomaly:
85+
86+
1. Click **Resolve Anomaly** to open the resolution popup.
87+
1. Select one of the following resolutions to help improve the algorithm:
88+
- The anomaly amount was too small
89+
- This is an unexpected increase
90+
- This is an expected increase
91+
1. **Add context** about why it is or is not an anomaly.
92+
1. Click **Resolve** to move it to the Resolved tab.
5893

5994
This is an example of how to mark a cost anomaly as significant and explain why it's an anomaly:
6095

61-
{{< img src="cloud_cost/anomalies/cost_anomalies_side-panel_is-significant.png" alt="You can mark cost anomaly as significant and write an explanation about why it's an anomaly and then resolve it" style="width:90%;" >}}
96+
{{< img src="cloud_cost/anomalies/cost_anomalies_side-panel_is-unexpected-1.png" alt="Form for marking an anomaly as unexpected with explanation field" style="width:80;" >}}
97+
98+
## Troubleshooting
99+
100+
If you're not seeing expected anomalies:
101+
- Verify that CCM is [properly set up][3]
102+
- Check that you have the necessary permissions for AWS
103+
- Review the time range of your anomaly view
104+
105+
For more help, contact [Datadog Support][4].
106+
107+
## Further reading
108+
109+
{{< partial name="whats-next/whats-next.html" >}}
110+
111+
[1]: https://app.datadoghq.com/cost/analyze/anomalies
112+
[2]: /dashboards/graph_insights/watchdog_explains
113+
[3]: /cloud_cost_management/setup/
114+
[4]: /help/
350 KB
Loading
506 KB
Loading
320 KB
Loading
334 KB
Loading
256 KB
Loading
841 KB
Loading
Binary file not shown.
267 KB
Loading

0 commit comments

Comments
 (0)