Member-only story

The Life Cycle of a Data Project

LZP Data Science
2 min readOct 30, 2019

--

Ever wondered how does a life cycle for a data project begin and end? Hint, its more of an ongoing process as In real-world scenarios, many of the tasks can be performed in a different order. The life cycle of a data project is in flux, meaning that steps could be backtracked or repeated depending on the results obtained.

Define the Business Problem

Easily considered the most critical step, in which the analyst identifies the business need and problems faced. This allows the analyst to consider the relevant data to be utilised for the business problem.

Studying and Cleaning of Data

Before running any algorithms or building models, the analyst should take the time to explore the dataset for patterns or what they can decipher from it. Afterwhich, cleaning of the data is essential if it contains issues such as missing data, extreme values or outliers. As some algorithms, e.g. the K-means clustering algorithm is susceptible to such problems, by cleaning the data, it allows a reduction on the adverse effects or results on the analysis performed. Data normalisation can also be applied to scale the data of an attribute into a smaller range.

Identify Data Analysis Techniques or Algorithms to be used

--

--

No responses yet