1
00:00:00,470 --> 00:00:00,820
Data

2
00:00:00,820 --> 00:00:04,450
scientists follow a certain process to understand the world,

3
00:00:04,450 --> 00:00:07,310
and it's called the Data Science Life Cycle.

4
00:00:07,310 --> 00:00:09,623
In this course, you'll learn more about this

5
00:00:09,623 --> 00:00:11,829
process and what each stage of it entails.

6
00:00:11,829 --> 00:00:14,494
But for now, I'll give you a quick preview.

7
00:00:14,494 --> 00:00:17,892
The first stage of the data lifecycle is to formulate a

8
00:00:17,892 --> 00:00:20,912
question you have or a problem you want to solve.

9
00:00:20,912 --> 00:00:24,547
Next, you acquire and clean data that is relevant to your

10
00:00:24,547 --> 00:00:25,845
question or problem.

11
00:00:25,845 --> 00:00:29,402
Third, you conduct exploratory data analysis.

12
00:00:29,402 --> 00:00:33,320
Fourth, you would use prediction and inference to draw conclusions

13
00:00:33,320 --> 00:00:34,164
from the data.

14
00:00:34,164 --> 00:00:37,491
Now, it's common for you to discover more questions you have

15
00:00:37,491 --> 00:00:40,423
or problems you need to solve after the fourth stage.

16
00:00:40,423 --> 00:00:42,967
So you would go through the process repeatedly,

17
00:00:42,967 --> 00:00:45,935
and that's why there's a positive feedback loop.

18
00:00:45,935 --> 00:00:49,462
The data science lifecycle is critical to how data scientists

19
00:00:49,462 --> 00:00:50,638
approach their work,

20
00:00:50,638 --> 00:00:54,000
and now you know the major stages of this process.


