Official project group thread

muellerzr · April 9, 2020, 10:58pm

@Dina IIRC people have done this with language models before (and here ULM-FiT). See here:

giacomov · April 10, 2020, 3:12am

I would look into Recurrent Neural Networks (in particular, LSTMs) for this problem first. They are kind of an old technique, but they are simpler to use then more modern solutions and they work well on this kind of problems.

Some resources:

Long-Short Term Memory network
https://colah.github.io/posts/2015-08-Understanding-LSTMs/

This is a GREAT presentation to get an intuition, before you dive into the details: https://livefreeordichotomize.com/2017/11/08/lstm-neural-nets-as-told-by-baseball/

For your problem, since you want to predict in the middle of the sequence, you can use a bi-directional LSTM:

csw · April 10, 2020, 3:32am

I’m assuming that the lack of responses means the project group sessions are done for now. Does everyone meetup in the smaller study groups? Is there a “meta-thread” with a list of the groups or do I just dig through the forum? Thanks.

akashpalrecha · April 10, 2020, 7:55am

You could look at the Source Code Study Group

Also, the Fastbook Study Group

Note that both these study groups generally have advanced discussions.

Margolis · April 10, 2020, 3:29pm

Would anyone be interested in teaming with me on the flower classification Kaggle competition https://www.kaggle.com/c/flower-classification-with-tpus as a group project? We can compare how fastai2 does with GPUs and TPUs, and we might be able to compare it to TensorFlow. We can also try techniques such as data augmentation, GANs for semi-supervised learning, and label smoothing https://towardsdatascience.com/what-is-label-smoothing-108debd7ef06

pinaki · April 10, 2020, 3:33pm

Looks interesting. Happy to team up. But looks like FastAI still doesn’t work with TPUs yet (?). This competition seems to be geared towards TPU usage. Seems like Pytorch started supporting TPUs recently https://discuss.pytorch.org/t/pytorch-tpu-support/25504

Margolis · April 10, 2020, 5:27pm

Best case scenario, we can test on fastai and then reproduce on TensorFlow. Google would at least appreciate the feedback in knowing where TensorFlow is falling short- for instance, it absorbed Keras to improve usability. Worst case, we can always run the fastai code on GPUs and then export the model to run on TPUs or even CPUs. It seems like Kaggle’s flower dataset is a good one to test our knowledge from the first few classes, even if it’s not quite what Google was looking for when it started this competition.

pinaki · April 10, 2020, 6:44pm

sounds good! Happy to collaborate – Have you started working on this already?

csw · April 11, 2020, 2:44am

Thanks for the info.

Margolis · April 11, 2020, 11:18am

I have not started but plan to soon. I just formed a team. What is your Kaggle user name, so I can invite you?

Margolis · April 11, 2020, 5:47pm

Awesome. You’ll need to form a team in order to merge. I went with “Fast Team”- seems simple enough. You can merge with that team, or you can tell me your team name and I can try merging.

kofi · April 11, 2020, 9:36pm

Hey David, I’m interested. I also want to have a deeper understanding of their approach

quantum · April 13, 2020, 6:46am

Hey Kofi, great!

I’ve been thinking that this might be the best approach in terms of learning:

https://blog.keras.io/how-convolutional-neural-networks-see-the-world.html

I think he provides a lot of information, some of which I don’t understand, but all of which seems really interesting. For instance:

“Note that we only go up to the last convolutional layer – we don’t include fully-connected layers. The reason is that adding the fully connected layers forces you to use a fixed input size for the model (224x224, the original ImageNet format). By only keeping the convolutional modules, our model can be adapted to arbitrary input sizes.”

This makes me think of many questions, including:
Wait wut? The last convolutional layer is fully connected? Last in what direction? If it’s the one before the target… how is it different from all the others? In the diagrams I’ve seen, each layer is fully connected, which means each node in this layer n gets inputs from every node in layer n-1.
“adding the fully connected layers forces you to use a fixed input size for the model” I don’t understand this at all. I believe he wrote somewhere else that in Keras, each layer is aware of how many inputs are coming in, so it ‘does it automatically for you’. So that should be able to be turned on or off, I would think. And then what does it mean to not be fully connected?

At any rate, how do you feel about trying to get this understood and working?

Thanks,

David

jeremy · July 23, 2020, 11:25pm

I’ve just retired from my work on Masks4All so I can focus full-time on fast.ai again, and I’m planning to use Discord as a regular hangout place. So please drop by and hang out if you’re interested in being involved with rolling out fastai2, the course, etc. I’m hoping to do some screen sharing as well, although haven’t set up specific times yet.

Here’s the Discord:

AndreaPi · July 31, 2020, 5:00pm

Nice! I will drop by for sure. BTW, I forgot the “go live” date of the 2020 course…is it still September or did it change?

anishjain · October 26, 2020, 9:58am

this link shows an invalid link in discord

Zaraberg · July 17, 2025, 12:09pm

Really inspired by the variety of projects shared here — it’s amazing to see how many directions AI can go in when people apply it creatively. I’m currently working on something NLP-related, but I’ve hit a few roadblocks when it comes to more advanced model behavior and integration. Has anyone here collaborated with external experts or consulted with professionals to push a project forward? Curious how that worked out and if you’d recommend it.