Kaggle Competition – Creating a Titanic Model in R

The kaggle competition for the Titanic dataset requires you to create a model out of the titanic data set and submit it. We will show you how you can begin by using RStudio.

You can watch Part Two of this series here.

Check out the data set we use here: Titanic Data Set
Download RStudio here: Download RStudio


Category: Canonical Pages
About The Author
- Data Science Dojo is a paradigm shift in data science learning. We enable all professionals (and students) to extract actionable insights from data.

1 Comment

  • Avatar

    I’m getting an error when trying to create the titanic.model. I followed all prior steps, so i’m unsure where I went wrong!

    This is the error:
    titanic.model <- randomForest(formula = survived.formula, data = train, ntree = 500, mtry = 3, nodesize = 0.01 * nrow(train))
    Error in na.fail.default(list(Survived = c(1L, 2L, 2L, 2L, 1L, 1L, 1L, :
    missing values in object


You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>