Hello @jeremy , I know this question has already been asked here but I wanted to go into more details on how to contribute to the library. More specifically:
- In what manner are we allowed to modify existing code? For instance lets say I want to change this function signature to something like:
def add_datepart(df, fldname, inplace=False):
"""
Some documentation
"""
if not inplace:
df = df.copy()
fld = df[fldname]
targ_pre = re.sub('[Dd]ate$', '', fldname)
for n in ('Year', 'Month', 'Week', 'Day', 'Dayofweek', 'Dayofyear',
'Is_month_end', 'Is_month_start', 'Is_quarter_end', 'Is_quarter_start', 'Is_year_end', 'Is_year_start'):
df[targ_pre+n] = getattr(fld.dt,n.lower())
df[targ_pre+'Elapsed'] = (fld - fld.min()).dt.days
return df.drop(fldname, axis=1, inplace=inplace)
This will break existing code (such as the one from your 1st ML course) which uses the lib as the default implementation assumes the inplace
var to be True
(I know data science is not software engineering but having a clear and well defined API with some doc always helps).
So the question is: If I submit a pull request with such code, will it be accepted? If not, how can we make breaking changes to the API? Could we ask for a merge on a separate branch for a later version?
- Is there any rules we need to respect while submitting a Pull request?
- Should we worry about code style like PEP8 and docstyle (Google) in the future? (even if I understood from your ML course that for now we don’t worry too much about it for now)
- There is no
setup.py
yet so we can’t version nor install the lib viapip+git
yet. Is it planned to have it by the end of the pre-alpha stage or we can already create a PR and submit it to you?
I know the lib is still in pre-alpha but considering we are starting the DL course on Monday and the course uses that lib (as well as your course on ML) these questions will quickly become relevant for people who will want to contribute. Thanks