If I now tweak the loss function to something else (complete nonsense, of course!)
def mnist_loss(predictions, targets):
predictions = predictions.sigmoid()
# mind the change of the second argument from 1-predictions to 1+predictions
return torch.where(targets==1, 1+predictions, predictions).mean()
This is in the lesson 4 notebook. This is probably a Python newbie question - definitely a pandas newbie question, but, Iām trying to fundamentally understand how pandas modifies all records in this assignment and adds a new column:
With what looks like a single concatenated string assignment, pandas has created a new column for all rows and modified the column according to the logic in the string concatenation. Iām coming to Python from other languages, so is this a pythonic thing? Or has pandas overridden object property assignment and theyāre using the expression as a shortcut to modify all rows? In another language youād just end up with df[āinputā] as a property with a single string value - or it might throw an error because e.g. df.context isnāt a variable that can be concatenated.
Iāve had a look at the pandas docs and no luck, poked around regarding python overriding assignment operators and found that it is possible to do this for object properties but didnāt immediately see that in Pandas or an explanation of how it works. Just need a pointer on what to look up to understand this behavior.
You can override operators, using what are called ādunderā methods. In this case pandas dataframe defines __add(b)__, which overrides the + operator.
At the core of the NumPy package, is the ndarray object. This encapsulates n-dimensional arrays of homogeneous data types, with many operations being performed in compiled code for performance.
NumPy has the ufunc (universal function) which is:
a function that operates on ndarrays in an element-by-element fashion, supporting array broadcasting, type casting, and several other standard features. That is, a ufunc is a āvectorizedā wrapper for a function that takes a fixed number of specific inputs and produces a fixed number of specific outputs.
As an example, here is the source code for the add method for character arrays, which returns the numpy.add function which is a ufunc. I canāt spell out exactly how this translates to the line of code you are referencing, but conceptually itās related.
Here is NumPyās description of broadcasting which comes up a lot when working with pandas (and also PyTorch). The single strings 'TEXT1: ', 'TEXT2: ' and 'TEXT3: ' are ābroadcastedā to all elements in the columns df.context, df.target and df.anchor when they are concatenated with the + operator.
I have a question regarding the MNIST deep learning model that was built in Chapter 4. As I understand it, our intent was to classify the input into on of two categories, threes and sevens, why then did we decide to use a linear model for that purpose? Personally, the first idea that came to my mind was a classification model such as logistic regression.
I attempted to run the chapter 10 notebook on Paperspace using one of the free machine configurations including GPU. It failed on the model tuning step, complaining about running out of GPU memory:
My question isnāt primarily about the advice the error text provided (although I would not turn down advice on the advice), but about the claim that PyTorch reserved 5.37 GiB of the 7.79 GiB GPU memory capacity. Is that usual, and if so, doesnāt that define a floor on GPU requirements that makes free resources not so useful?
I had this half baked idea after seeing a news piece on bitcoin. Could NLP help to identify Satoshi Nakamoto, the author of the original bitcoin whitepaper? Satoshi Nakamoto is a pseudonym, the real identify of the author remains a mystery.
Could NLP help identify the authorās real name? A couple half baked approaches:
Use the Abstract as the known, author as unknown, Journal of Cryptography up to the publication date of the paper as the data set. The test dataset is just the single bitcoin paper. This relies on the assumption that the author has published in Journal of Cryptography previous to the bitcoin paper, and abstracts being unique enough to different authors. Multiple authors on a paper would be a sticky point, treat them as one author/token in the vocabulary? Use ULMfit due to the size/number of taokens required for abstracts. Getting all of the abstracts and authors quickly and easily is another sticky spot - does the Journal of Cryptography offer an API?
Or, use the authors in References as the known, paper author as unknown, again using the J Cryptography. This has the assumption that researchers tend to reference certain other researchers more frequently. And the same challenge here of getting all authors and the References.
No such file or directory: ā/root/.fastai/data/imdb/models/finetuned.pthā
Iām not sure if this line: learn.save_encoder(āfinetunedā)
is working properly, because later in the notebook this line: learn = learn.load_encoder(āfinetunedā)
throws the error shown above.
Does anyone have an idea of what is going on, where my mistake may be?
Thanks,
Chris
Hi all, this is a silly question but Iāve not been able to resolve it on my own. How do I save/load the encoder while in Kaggle?
I am trying out the ULM fit method on a dataset using Kaggle, and Iām having issues saving and loading the finetuned encoder in Kaggle. Iām quite certain itās a directory issue due to the fact that the error shows itās trying to pull from fastaiās data location. I tried to set my model directory to kaggle/working using the below code.