How to run PySpark with 3rd party Jars e.g. fuzzywuzzy? -
tried --jars option , --driver-class-jars etc. still gave me 'no module fuzzywuzzy' found error.
try pyspark --packages me.xdrop:fuzzywuzzy:1.1.8
also have @ https://stackoverflow.com/a/44153456/3811916 other options, depending on desired workflow/environment.
Comments
Post a Comment