
Covid-19 spread in United States
PROBLEM
The COVID-19 pandemic has had a profound impact on the United States, posing significant challenges in controlling the spread of the virus. Despite various mitigation measures and vaccination campaigns, the virus continues to spread in some regions, leading to clusters of cases and potential outbreaks. Identifying the factors contributing to the continued spread and developing effective strategies to mitigate it are critical public health objectives.
GOAL
The primary goal of this case study is to investigate the factors contributing to the ongoing spread of COVID-19 in the United States and propose strategies for mitigating the spread.

Skills Used
Data cleaning
​
Data grouping and summarizing
​
Descriptive analysis
​
Exploratory analysis
​
Regression analysis
​
Developing insights
​
Visualization
​
Storytelling

Tools Used
Language:Python
​
Python libraries used:
Pandas - Data Analysis
Tableau – Visualization
​
Software:
​
Microsoft Excel
​
Microsoft Powerpoint
​
Jupyter Notebooks

Data Source
The data is provided by the Johns Hopkins University through their excellent github repo.Data is downloaded from Kaggle link click here.
Covid-19 Insights
Is there a correlation between population vs covid cases?

Hypothesis:
As the population increases,covid cases also increases.
-
To test the hypothesis linear regression is conducted. Most of the datapoints are close to the upward regression line.
​​
-
The R-squared value is 0.97 which indicate strong relationship between population and covid cases.
Is there a correlation between population vs covid cases
Hypothesis:
As the cases increases,covid deaths also increases.
​
-
To test the hypothesis linear regression is conducted. Most of the datapoints are close to the upward regression line.
-
The R-squared value is 0.95 which indicate strong relationship between population and covid cases.

Why are some states affected heavily by covid?

-
California,Newyork,Georgia,Texas has the highest number of covid cases and deaths.
​​
-
The states like Rhode Island, Delaware, District of Columbia and the Hawaii have least number of covid deaths and cases.
-
The number of COVID cases and COVID deaths were recorded high in the states of California,Texas,NewYork and Georgia.
​​
-
At the same time states such as Rhode Island, Delaware, District of Columbia and the Hawaii had least number of COVID cases and COVID deaths.

​From above two graphs, it is inferred that COVID cases and COVID deaths in each states are directly proportional to their population

-
The states like California,Georgia,Texas,Newyork need to given more priority allocating health workers.
​
-
The low populated states like Rhode Island, Delaware, District of Columbia and the Hawaii need to given less priority allocating health workers.
