Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

getting-started-with-pyspark

Public

In this tutorial, I explained SparkContext by using map and filter methods with Lambda functions in Python and created RDD from object and external files, transformations and actions on RDD and pair RDD, PySpark DataFrame from RDD and external files, used sql queries with DataFrames by using Spark SQL, used machine learning with PySpark MLlib.

Creat2019-12-30T02:54:50
Update2024-04-20T02:47:41
https://developer.ibm.com/tutorials/getting-started-with-pyspark/
5
Stars
0
Stars Increase