Data Analysis
Careful, this website is still under construction.
Data Analysis
A subtle default setting unknown to many new analysts is that jupyter notebook only displays 20 columns of a pandas dataframe. When there are more than 20 columns in the dataset, it is truncated right in the middle while the truncated columns are represented by ellipsis [...]
If you learn this quick tip I'm about to demonstrate early in your learning journey, you just became a little rockstar. I'll be using the Ames housing data with 2930 rows and 82 columns.
By default, this is what you get
To show all columns, simply add this function after the read_csv command
This works, but the rows are still truncated.
In rare cases, you may want to view all the rows as well. Simply change 'columns' to 'rows' and add the function just like you did earlier.
Change the number of rows and columns shown to default by changing 'set_option' to 'reset_option'
The best thing is to simply show all columns, and specify the number of rows to be shown (usually between 5 and 10).