The process of converting unstructured text data into meaningful data for analysis. ‹ Test Data up Training Data › Book