tezvyn:

How would you handle 10% nulls in a key column?

Source: scikit-learn.orgbeginner

This tests your ability to diagnose data issues before solving. First, investigate why data is null. Then, discuss trade-offs of dropping vs. imputing with the mean or median, considering the impact on the dashboard's accuracy.

This tests your ability to diagnose data integrity issues before solving and articulate statistical trade-offs. A great answer starts by investigating *why* 10% of the data is null. Then, it weighs the pros and cons of key options: dropping rows (data loss), imputing with the mean (reduces variance), or using the median (more robust). The final choice should be justified by the data's distribution and the dashboard's goal. A red flag is immediately picking a method without this diagnostic step.

Read the original → scikit-learn.org

Get five bites like this every day.

Tezvyn delivers a daily feed of 60-second tech bites with quizzes to lock in what you learn.

How would you handle 10% nulls in a key column? · Tezvyn