In the first week, Kevin introduces us to some starter code to help develop the tools needed to start looking at large data sets.
Here is the link to the code he was working on and that you will be working on this week: Google Collab Page
Week 2 – More BAsics
In week 2, Kevin introduces us to some more basics of coding including plotting in this very short video.
WEEK 3 – TITANIC DATA
This week Kevin is going to jump into working with actual data so we can start visualizing and organizing data.
>
bins = np.arange(0, df_dropna['age'].max() + 10, 10)
plt.figure(figsize=(10, 6))
# Plotting stacked histogram
plt.hist([age_died, age_survived],
bins=bins, alpha=0.5, label=['Did Not Survive', 'Survived'], stacked=True)
plt.xlabel('Age')
plt.ylabel('Number of Passengers')
plt.title('Stacked Histogram of Age vs. Survivability')
plt.legend()
plt.grid(True)
plt.show()
age_died = df_dropna[df_dropna['survived'] == 0]['age']
age_survived = df_dropna[df_dropna['survived'] == 1]['age']