Computer Science – Data SCIENCE

In the first week, Kevin introduces us to some starter code to help develop the tools needed to start looking at large data sets.

Here is the link to the code he was working on and that you will be working on this week: Google Collab Page

Week 2 – More BAsics

In week 2, Kevin introduces us to some more basics of coding including plotting in this very short video.

WEEK 3 – TITANIC DATA

This week Kevin is going to jump into working with actual data so we can start visualizing and organizing data.

Here is the link to the code for this week – Google Collab 3

>

bins = np.arange(0, df_dropna['age'].max() + 10, 10)

plt.figure(figsize=(10, 6))

# Plotting stacked histogram
plt.hist([age_died, age_survived],
bins=bins, alpha=0.5, label=['Did Not Survive', 'Survived'], stacked=True)

plt.xlabel('Age')
plt.ylabel('Number of Passengers')
plt.title('Stacked Histogram of Age vs. Survivability')
plt.legend()
plt.grid(True)
plt.show()
age_died = df_dropna[df_dropna['survived'] == 0]['age']
age_survived = df_dropna[df_dropna['survived'] == 1]['age']