How to do the Titanic Kaggle competition in R – Part 1

As part of submitting to Data Science Dojo’s Kaggle competition you need to create a model out of the titanic data set. We will show you how to do this using RStudio.

Titanic Data Set:
https://www.kaggle.com/c/titanic

Download RStudio:
https://www.rstudio.com/products/rstudio/download/

(777)

About The Author
- Data Science Dojo is a paradigm shift in data science learning. We enable all professionals (and students) to extract actionable insights from data.

1 Comment

  • TG
    Reply

    Hi,
    I’m getting an error when trying to create the titanic.model. I followed all prior steps, so i’m unsure where I went wrong!

    This is the error:
    titanic.model <- randomForest(formula = survived.formula, data = train, ntree = 500, mtry = 3, nodesize = 0.01 * nrow(train))
    Error in na.fail.default(list(Survived = c(1L, 2L, 2L, 2L, 1L, 1L, 1L, :
    missing values in object

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>