load module

load.py

This Python module contains methods to load transformed data into a database.

load.load_data(spark: SparkSession, df: DataFrame, db_file: str = 'db.parquet')

Collect data locally and write to a parquet file.

Parameters
  • spark – Spark session used.

  • df – DataFrame to store.

  • db_file – Database filename.

Returns

None