Python Idle
Real datasets are messy. Can you create one with specific statistical properties? This quest flips the script: instead of analyzing data someone else collected, you'll engineer a dataset that meets precise specifications. Control the mean, inject correlations, mix in categories, and sprinkle missing values — all by design.
~15 minsandbox lab
Loading Python runtime...
Goals: 6 tests
should create a DataFrame with shape (200, 4)
Column A should have mean between 45 and 55
Column A should have std between 8 and 12
Columns A and B should be correlated (r > 0.6)
Column C should have exactly 3 unique categories
Column D should have approximately 20 missing values
Python loading...