data-analytics
  • Introduction
  • Azure HDInsight
    • Create a Spark cluster
    • Azure HDInsight SQL
  • Spark
    • RDD
      • Common RDD operations
      • Exercise: Compute Pi
      • Pair RDDs
      • Exercise: Word count
      • Exercise: Intrusion detection
      • Shared variables
    • File operations
    • Spark SQL
    • Data visualisation
      • Power BI visualisation
    • Machine learning
      • Statistics
      • Feature extraction
      • Classification & Regression
  • References
Powered by GitBook
On this page

References

PreviousClassification & Regression

Last updated 7 years ago

  • Learning Spark Lighting Fast Data Analysis - Holden Karau, Andy Konwinski, Patrick Wendell & Matei Zaharia

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Azure Spark docs
Spark RDD Part 1 & 2