Data Exploration Skill
When exploring a dataset, follow this structured approach. Adapt based on whether the data is tabular (CSV/DataFrame), image-based, or text.
1. Schema and Structure
First, understand what you're working with:
- Load a sample (first 5 rows + last 5 rows)
- Column names, dtypes, and count of non-null values
- Dataset dimensions (rows x columns)
- Memory usage
- Identify the target variable if this is a supervised learning task
For tabular data:
df.info()
[Description truncada. Veja o README completo no GitHub.]