finshed the lab #4

potatchipsxp · 2020-10-07T22:41:04Z

No description provided.

dhicks

I've identified a few problems that were preventing the script from running, then some further problems with failed tests. Fix the problems noted; use the "Source with echo" button to make sure the script runs; and then use testthat to identify further issues before you resubmit.

dhicks · 2020-10-08T22:30:39Z

lab.R

-#' 
+
+
+#I dont know how to do this


A plotly approach: Generate the static plots as above. Then call plotly::ggplotly() to generate an interactive version. In RStudio that will show up in the lower-rate panel (the same one as static plots and help files). Navigate that to identify which counties have these giant swings.

A filter approach: Look at the static plot to find a good threshold for "large" swings. Say, abs(cases_per_pop) larger than 1000. Write a filter for these large swings, then count the number of rows by county to figure out which counties.

Next, filter down to those counties, then check things like the population.

dhicks · 2020-10-08T22:37:38Z

lab.R

-# ggplot(covid_df, aes(---, ---, group = ---)) +
-#     geom_line()
+ggplot(covid_df, aes(date, cases_per_pop, group = county)) +
+    geom_line() +


With this hanging +, R tries to combine this ggplot object with the thing next (the ggplot object on line 137). It can't do this so the lab script fails before running the tests. This is why testthat::test_dir('tests') is failing.

Suggested change

geom_line() +

geom_line()

dhicks · 2020-10-08T22:40:26Z

lab.R

+         date<='2020-07-30') %>%
+  group_by(county, fips) %>% 
+  summarize(cases_per_pop=sum(cases_per_pop)) %>% 
+  ungroup() %>% 


The hanging pipe here is also causing the script to fail when it's run.

Suggested change

ungroup() %>%

ungroup()

dhicks · 2020-10-08T22:44:55Z

lab.R

+  date>='2020-06-01',
+  date<='2020-06-30') %>%
+  group_by(county, fips) %>% 
+  summarize(parks=mean(pct_diff, rm.na =TRUE)) %>% 


Typo. So mean() is not ignoring NAs and you end up filtering out more counties than intended.

Suggested change

summarize(parks=mean(pct_diff, rm.na =TRUE)) %>%

summarize(parks=mean(pct_diff, na.rm =TRUE)) %>%

dhicks

Looks like the only remaining issue is the filter in cases_july

dhicks · 2020-10-14T16:08:45Z

lab.R


+cases_july = covid_df %>%
+  filter(date>='2020-07-01',
+         date<='2020-07-30') %>%


Double check the number of days in July

Suggested change

date<='2020-07-30') %>%

date<='2020-07-31') %>%

finshed the lab

ec5c870

dhicks requested changes Oct 8, 2020

View reviewed changes

fixing lab

0d30539

dhicks requested changes Oct 14, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

finshed the lab #4

finshed the lab #4

Uh oh!

potatchipsxp commented Oct 7, 2020

Uh oh!

dhicks left a comment

Uh oh!

dhicks Oct 8, 2020

Uh oh!

dhicks Oct 8, 2020

Uh oh!

dhicks Oct 8, 2020

Uh oh!

dhicks Oct 8, 2020

Uh oh!

dhicks left a comment

Uh oh!

dhicks Oct 14, 2020

Uh oh!

Uh oh!

	summarize(parks=mean(pct_diff, rm.na =TRUE)) %>%
	summarize(parks=mean(pct_diff, na.rm =TRUE)) %>%

finshed the lab #4

Are you sure you want to change the base?

finshed the lab #4

Uh oh!

Conversation

potatchipsxp commented Oct 7, 2020

Uh oh!

dhicks left a comment

Choose a reason for hiding this comment

Uh oh!

dhicks Oct 8, 2020

Choose a reason for hiding this comment

Uh oh!

dhicks Oct 8, 2020

Choose a reason for hiding this comment

Uh oh!

dhicks Oct 8, 2020

Choose a reason for hiding this comment

Uh oh!

dhicks Oct 8, 2020

Choose a reason for hiding this comment

Uh oh!

dhicks left a comment

Choose a reason for hiding this comment

Uh oh!

dhicks Oct 14, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!