Modin (formerly Pandas on Ray)

Has anyone worked with Modin? will you share your exp with this library?

Modin is a DataFrame library that allows you to speed up your panda’s workflows by changing one line of code. We were motivated to create this tool by an interesting trend we observed. We noticed that many Data Scientists are often forced to use different tools to perform the same tasks at different data scales. Not only are these tools completely different, they expose new APIs and sometimes require user input for system information (e.g. partitioning). When we started Modin, we took the approach that Modin should be a DataFrame for datasets ranging from 1KB to 1TB+, without needing to specify partitioning. We believe Data Scientists should be spending their time extracting value from their data in the DataFrame API that they use the most: pandas.

I got it installed, but the import fails:

TypeError Traceback (most recent call last)
in ()
----> 1 import modin.pandas as pd

~/anaconda3/lib/python3.7/site-packages/modin/pandas/ in ()
78 include_webui=False,
79 redirect_worker_output=True,
—> 80 use_raylet=True,
81 )
82 except AssertionError:

TypeError: init() got an unexpected keyword argument ‘use_raylet’