Contributing to the new fastai framework

iskode · November 2, 2017, 4:39pm

Hi @jeremy ,
I’ve made a quick tour of the new Fastai library that’s a framework as you mentioned.
I would like to start making simple contributions like commenting the methods.
I think everyone who types SHIFT+TAB is expecting to see a description of the method but
at the current stage some methods are self-explanatory but for some other we must test them
before understanding what they do.
And I think with this community, it’ll go fast to have the framework documented.
What do you think about that ? or you didn’t plan it to be done for now?

jeremy · November 2, 2017, 4:44pm

I would love that This is a great thread for us to discuss any contributions to the framework, or questions about it.

iskode · November 2, 2017, 4:48pm

So Where to start?
In fact I’ve never done a pull request! I’ve just heard about that.
So should post here comments I made for a method then receiving comments and finally commit it to github?

sanjeev.b · November 2, 2017, 5:10pm

I think it would be useful to allow fastai to run on CPU too (primarily for code development using small sample data). I saw a few places where .cuda() is called. I wrapped that around if (torch.cuda.is_available()) but then ran into some other deeper problem.

Is there anything in the design that fundamentally prevents that? If not, is it worth investing in the path to get it to work on CPU?

jeremy · November 2, 2017, 7:38pm

Here’s an easy way to create a PR: GitHub - mislav/hub: A command-line tool that makes git easier to use with GitHub.

Yes definitely worth it. It shouldn’t take changes to more than a couple of lines of code - if it does, we should figure out how to simplify it!

jacquerie · November 2, 2017, 7:51pm

It’s not exactly a contribution, but I’ve been thinking that, if you ever release it independently of fast.ai, you could call it “Prometheus”, as it’s bringing PyTorch to humans.

yinterian · November 2, 2017, 8:47pm

I am happy to help with pull requests.

ramesh · November 2, 2017, 9:00pm

Hi @jeremy -
I created a Pull Request to make the code compatible with GPU and CPU. More details here - https://github.com/fastai/fastai/pull/12

Any feedback from Jeremy or anyone on this thread is greatly appreciated.

iskode · November 2, 2017, 9:19pm

This name sounds good but is a bit long.
As you might notice @jeremy likes short names, tfms(transformations), sz(size).
No fluff, just to the point and I find fastai to be very good, short and the name stands for what it is for:
start playing with your Deep Learning model with the least lines of code.

hiromi · November 2, 2017, 9:40pm

I can contribute to writing docstring as well. I think it is good to have a consistent format so that different parts of the library do not use different format though.

Any preferences, @jeremy?

Otherwise, would something like this work okay?

''' Brief description of a function

A little more detailed description of a function

:param arg1:
:param arg2:
:return:
'''

Oh! I see google style docstring in conv_learner.py. Should we use https://google.github.io/styleguide/pyguide.html?showone=Comments#Comments ?

sanjeev.b · November 2, 2017, 9:45pm

Tried it. Works well! Ship it

satya · November 2, 2017, 10:29pm

prmths?

jeremy · November 2, 2017, 10:39pm

Thanks! @yinterian has written the docstrings that are there now, so use the same approach as she has, unless you think there’s a better way, in which case please let us know what/why and we can discuss here.

jeremy · November 2, 2017, 10:40pm

That’s a really nice name. Although I think we’re going to stick with calling it ‘fastai’

hiromi · November 3, 2017, 2:20am

Sounds good! It looks like transforms.py has the most docstrings, so I will follow suit

trusttheai · November 3, 2017, 2:33pm

I have submitted https://github.com/fastai/fastai/pull/11

This pull request fixes a bug and has a version of read_dirs which is atleast 3x faster on my machine.

beecoder · November 3, 2017, 8:36pm

It’s great to have access to a working codebase. There are a few TODO’s ( @jeremy: breadcrumbs?). This could be a place to start making contributions

jeremy · November 3, 2017, 8:53pm

I think nearly all of those TODOs are from @yinterian, so we should ping her!

jeremy · November 3, 2017, 8:59pm

When making changes to the code, try to add a test that shows that the new code works, if possible. Currently, there are no tests, which is of course not how we want things long term!

yinterian · November 3, 2017, 10:12pm

One problem that we are having is that sometimes your code breaks because tmp directory is missing something. We need stronger checks there.

github.com

fastai/fastai/blob/master/fastai/conv_learner.py

from .core import *
from .layers import *
from .learner import *
from .initializers import *

model_meta = {
    resnet18:[8,6], resnet34:[8,6], resnet50:[8,6], resnet101:[8,6], resnet152:[8,6],
    vgg16:[0,22], vgg19:[0,22],
    resnext50:[8,6], resnext101:[8,6], resnext101_64:[8,6],
    wrn:[8,6], inceptionresnet_2:[-2,9], inception_4:[-1,9],
    dn121:[0,7], dn161:[0,7], dn169:[0,7], dn201:[0,7],
}
model_features = {inception_4: 3072, dn121: 2048, dn161: 4416,} # nasnetalarge: 4032*2}

class ConvnetBuilder():
    """Class representing a convolutional network.

    Arguments:
        f: a model creation function (e.g. resnet34, vgg16, etc)
        c (int): size of the last layer

This file has been truncated. show original

inside this function

def get_activations(self, force=False):

One case that breaks is the following: (1) run your code without providing a test folder (2) run your code after providing a test folder.

Let me know if this makes sense. Thank you!