Part 2 Lesson 12 wiki

fmichaelkunz · April 17, 2018, 2:55am

Why don’t we create a GAN to create ‘fake’ news in the style of a favorite politician?

Deb · April 17, 2018, 2:56am

To generate fake news you would need GAN not to identify…

yggg · April 17, 2018, 2:56am

Both nn.ConvTranspose2d and nn.Upsample seem to do the same thing, i.e. expand grid-size (height and width) from previous layer.

Can we say nn.ConvTranspose2d is always better than nn.Upsample, since nn.Upsample is merely resize and fill unknowns by zero’s or interpolation?

blakewest · April 17, 2018, 2:56am

Somebody indeed already built a fake news detector: https://towardsdatascience.com/i-trained-fake-news-detection-ai-with-95-accuracy-and-almost-went-crazy-d10589aa57c

Though really, they just built an “Associated Press writing style detector”. That’s all it does, which may or may not be that useful in practice…

rachel · April 17, 2018, 2:59am

You might also be interested in the Fake News Challenge from last year.

fmichaelkunz · April 17, 2018, 2:59am

Is it “Giff” or “Jiff”? Jeremy has answered “Giff”. Thus endeth the debate.

fmichaelkunz · April 17, 2018, 3:05am

But …" tanh" is “thann”

sgugger · April 17, 2018, 3:05am

Shouldn’t we use a sigmoid if we want values between 0 and 1?

fmichaelkunz · April 17, 2018, 3:06am

http://reference.wolfram.com/language/ref/Tanh.html tanh is a sigmoid…

keitabr · April 17, 2018, 3:09am

Can anyone recommend any papers or blog posts which apply GANs to text generation?

AdrienLE · April 17, 2018, 3:09am

Yes but when people talk about “the” sigmoid function, they usually mean the logistic sigmoid function specifically: https://en.wikipedia.org/wiki/Logistic_function

fmichaelkunz · April 17, 2018, 3:11am

@AdrienLE in traditional stats, when I was taught the stuff, they were used interchangeably or for convenience. Logistic is easier to compute than the normal, so that was default. Has bigger tails than normal (kurtosis). tanh… well…its the long lost cousin of the bunch.

keratin · April 17, 2018, 3:12am

I remember reading on reddit Ian Goodfellow said that GANs don’t work well for text. I forget why. Maybe Jeremy can confirm? @rachel

nirantk · April 17, 2018, 3:12am

Found a dataset for Fake vs Real News: https://github.com/KaiDMML/FakeNewsNet from researchers at Univ of Arizona.

KevinB · April 17, 2018, 3:14am

It looked like Jeremy uses the y of tanh as -1 to 1 and the y of sigmoid as 0 to 1.

AdrienLE · April 17, 2018, 3:19am

Is there any reason for using RMSProp specifically as the optimizer as opposed to Adam etc.?

snagpaul · April 17, 2018, 3:21am

Is there a link to EM algorithms where we train one thing and then the other?

agaldran · April 17, 2018, 3:22am

Hi!

Which could be a reasonable way of detecting overfitting while training? Or of evaluating the performance of one of these GAN models once we are done training?

In other words, how does the notion of train/val/test sets translate to GANs, and how do we handle them?

Thanks!

snagpaul · April 17, 2018, 3:23am

I have that question about unsupervised methods in general.

emilmelnikov · April 17, 2018, 3:24am

Are we supposed to get “good” bedrooms? My bedrooms are awful after 2 iterations from the notebook.