pyspark.sql.streaming.DataStreamWriter.option#
- DataStreamWriter.option(key, value)[source]#
Adds an output option for the underlying data source.
New in version 2.0.0.
Changed in version 3.5.0: Supports Spark Connect.
Notes
This API is evolving.
Examples
>>> df = spark.readStream.format("rate").load() >>> df.writeStream.option("x", 1) <...streaming.readwriter.DataStreamWriter object ...>
The example below specifies ‘numRows’ option to Console source in order to print 3 rows for every batch.
>>> import time >>> q = spark.readStream.format( ... "rate").option("rowsPerSecond", 10).load().writeStream.format( ... "console").option("numRows", 3).start() >>> time.sleep(3) >>> q.stop()