Showing posts from January, 2021

2021 - the year of conversion.

 Data science, and understanding ubiquitous data streams, is clearly a hot technology topic.  The foundation of these exercises is having clean factored data (time, location, source, and all relevant attributes).  Often these data streams are in different formats and have different factors, or these rough streams require data mining to capture and catalog. How many of us have received emails and snail mail generated from dirty data sources?  I personally enjoy seeing these errors in my mail pieces and searching for clues in the defects presented. Do you have text and data conversion projects planned in 2021?  Do you have plans to verify and validate clean data?