Objectives covered by the block (8 exam items)
Data Collection
- Objective 1a.1 – Assess candidates’ understanding of data collection methods, appropriate usage, and limitations
- Objective 1a.2 – Test candidates’ ability to select and implement appropriate data collection techniques based on project requirements
- Objective 1a.3 – Evaluate candidates’ knowledge of sourcing data from various databases and APIs
- Objective 1a.4 – Evaluate candidates’ knowledge of using pandas for basic data loading, manipulation, and summarization
- Objective 1a.5 – Test candidates’ proficiency in reshaping DataFrames to meet analysis goals
Data Cleaning
- Objective 1b.1 – Assess candidates’ ability to identify what constitutes bad data
- Objective 1b.2 – Evaluate candidates’ ability to identify and rectify common data quality issues such as missing data, inconsistent data, or anomalies
- Objective 1b.3 – Test candidates’ proficiency using programming languages (like Python) to clean and pre-process data
- Objective 1b.4 – Assess the candidates’ ability to implement strategies for handling outliers and missing data
- Objective 1b.5 – Evaluate candidates’ understanding and application of data cleaning using Python libraries
Data Validation (text, numeric, audio, video, & social data)
- Objective 1c.1 – Evaluate candidates’ understanding of data validation techniques, including data type validation, range validation, and cross-reference validation
- Objective 1c.2 – Test candidates’ ability to validate data from various formats (text, numeric, audio, video, & social data)
- Objective 1c.3 – Evaluate candidates’ ability to ensure the reliability and accuracy of the collected data
- Objective 1c.4 – Assess candidates’ ability to implement robust data validation procedures to ensure the reliability and accuracy of the collected data
- Objective 1c.5 – Test candidates’ knowledge of consolidating and validating data from multiple datasets and formats