Executor for Apache Spark

Could Spark be added as a supported executor?

Maybe RDD.map or RDD.mapPartitions would be the correct way to map a function similar to [`map_unordered`](https://github.com/cubed-dev/cubed/blob/main/cubed/runtime/executors/lithops.py#L190) in the Lithops executor. 

https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.RDD.mapPartitions.html#pyspark.RDD.mapPartitions

To support this a guess would need to be made up front on the reserved memory available for python UDFs. It sounds like currently this would be done globally but maybe later could be done on a per-operator basis?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Executor for Apache Spark #499

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Executor for Apache Spark #499

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions