Lesson 2 - Creating your own dataset from Google Images

aditya_new · June 28, 2020, 12:05pm

Thank you Johannes. Appreciate your help.

banksiaboy · June 29, 2020, 2:36am

Hey @mrfabulous1,
Ha - If you showed the pics to 3 machinists they’d know straight away - I’m not one - but grew up amongst them. Its funny - there were very few classification mistakes. It seems that the system struggled to differentiate individuals but was very good at classification.
Grinders all have a grinding wheel in-line with the moving bed, and a tray with ends to catch the waste. lathes have a horizontal axis, milling machines have a vertical axis and a tool-holder/chuck not a grinding disk and no ends on the tray to catch sparks/waste. But the rest of the bits and bobs are very similar between the mills and grinders. And all cats and dogs have ears and eyes and fur mostly

Its very reassuring what you say. I feel encouraged to keep going!
Feature engineering must be the art.

Cheers, and many thanks!

–me

banksiaboy · June 29, 2020, 3:41am

@mrfabulous1 - check this, I tried again with resnet50 on the filtered results. Pretty good i think. Must have been too lo-res for all that confusing detail…

Cheers…

mrfabulous1 · June 29, 2020, 7:57pm

Hi banksiaboy well done!

It looks like a good step if one experiences this type of problem is to try a stronger/model.

I’ll keep this in my repertoire.

Cheers mrfabulous1

mrfabulous1 · June 29, 2020, 8:21pm

Hi johannesstutz hope your having a jolly day!!

For a human maybe but not for an ai model
You could also try using an even bigger resnet model.

I would definitely use more images I always use a mininum of 200 per class for my first benchmark model.

I have added two links on feature engineering which I found quite helpful.

https://medium.com/mindorks/what-is-feature-engineering-for-machine-learning-d8ba3158d97a

Have a jolly day

johannesstutz · June 30, 2020, 4:30pm

Hi and thanks mrfabulous

I’ll keep experimenting with different models, augmentations, resolutions, …
I have around 400 images per category, maybe I’ll add some more and see if it helps.

Regarding feature extraction, thank you for the links. It makes sense to me on tabular data. But I’m not sure how to apply this in computer vision For example, method 3 in the second link, edge detection. Isn’t that something that a CNN should figure out for itself, via learnt (or pretrained) kernels?

mrfabulous1 · June 30, 2020, 5:08pm

Hi

johannesstutz hope all is well!

The paper below is one that Jeremy uses. in one of the video’s. I think its wonderful.

The paper above gives an insight into what CNN’s are doing. There may indeed be a layer that is highly correlated to the characteristic of lathes and grinders etc.

Currently whatever one does to improve your network, its a little like a dark art, in that unless your accuracy metric goes up or down one really doesn’t know much about which layers are being used the most. But I would postulate if you could apply this paper to your network and then make changes and see what actually happens. The results would be highly interesting.

I have seen people choose use a specific layer before they fine tune.

I wonder what the inside of your model looks like.

Image from above paper.

Cheers mrfabulous1

aditya_new · July 14, 2020, 4:03pm

Hi,
I’m a novice. Need some help please.

I have downloaded bear images from google, saved them in specific folders (black, grizzly & teddy). Renamed the files as blackbear1.jpg, blackbear2.jpg etc in the respective folders.

When running the below code:

np.random.seed(42)
data = ImageDataBunch.from_folder(path, train=".", valid_pct=0.2, ds_tfms=get_transforms(), size=224, num_workers=4).normalize(imagenet_stats)

I am getting 2 issues (IndexError and Exception) listed below.

IndexError : index 0 is out of bounds for axis 0 with size 0

During handling of the above exception, another exception occurred:

Exception Traceback (most recent call last) in

    1 np.random.seed(4)

----> 2 data = ImageDataBunch.from_folder(path, train =".", valid_pct=0.2, ds_tfms = get_transforms (), size=224, num_workers=4).normalize(imagenet_stats)

Exception: Can’t infer the type of your targets. It’s either because your data source is empty or because your labelling function raised an error.

Kindly suggest what could be the problem.

timmillea · August 19, 2020, 3:31am

Hey harsh did you ever find a solution to this? I am having the same issue.

KenHunziker · August 20, 2020, 4:41am

Hi all, finally got my first ML web app up and live and wanted to share some of my experience briefly. The app and model themselves aren’t really very special, but I’ll share them here.

(Resnet 34 model trained to identify if a mountain is Mt. Rainier [Washington State], Mt. Hood [Oregon], or Mt. Fuji [Japan]) - Give it a try!
https://mountainid-test.onrender.com/
https://web-app-early.wl.r.appspot.com/

For me, the model building was straightforward. I was able to work through the lesson 1 and 2 notebooks without issue to download images and train my model. The accuracy was ‘ok’ at 80% and I didn’t really focus much on improving it beyond the steps in the lesson. I then both saved (.pth) and exported (.pkl) my model - this becomes important later.

From there, I basically followed the ‘Production’ instructions for the Google App Engine and I found them to be again mainly straightforward. I hit a big snag once I started trying to deploy the model though. I kept getting a massive error with tons of lines of code as it tried to run script.py. I did a bunch of debugging including:

Deploying on Render (didn’t work)
Trying to run the original example script.py (that worked)
Checking that my model_file_url worked by typing into my browser and seeing if it downloaded directly (it did)

I was starting to sweat a little bit and frantically search through the forums at this point. My eureka moment was when I downloaded the model for the original example from Dropbox and it came back as “stage-2.pth.” That is when I realized that I was using the export .pkl instead of the .pth files.

Did I miss something in the instructions? I was under the impression that we should be using the exported model instead of the saved one.

In any case, that cleared things right up and my web app was up and running. I know this was a long post/story but I’m hoping that my experience can help give someone a kickstart if needed.

andrewn2000 · September 11, 2020, 11:47pm

I’m going to have save this thread because I’m exactly here and decided to take a breather and see if I know what I’m doing in lesson 2 before venturing on making the application. Thanks for all the tips though and I hope to make it as fancy as yours!

Andy

avkornaev · July 17, 2021, 6:38am

Hi! How did you do highlighted regions in the images of airplanes?

tedmasterweb · October 21, 2021, 3:06pm

The code below can be used, but with some human interaction. This code is somewhat slow but returns a more complete list. Also, the code below gets the original image rather than the version stored in Google, which may be lower res or just a thumbnail. To begin with, make sure your image search results do not exceed the minimum returned by Google by default as the code takes a while to run.

So, given the search: 360 digger -toy -book -dog -"t-shirt" -leurre -hat -"total solutions" -its -antennas -"metal working" -compressor -drill copy and paste the lines below into the console.

Here’s the search for a ride-on roller -toy -battery -skates -"roller blades" -bicycle -coaster

// set up the observer
function hrefTest(mutationList, observer) {
	mutationList.forEach( (mutation) => {
		if( mutation.attributeName === 'href' ) {
			let thisHref = mutation.target.href.split('imgurl=')[1].split('&imgrefurl')[0];
			urls.push(decodeURIComponent(thisHref));
			console.log(decodeURIComponent(thisHref));
		}
	});
}

const observerOptions = {
	childList: false,
	attributes: true,
	subtree: false
}
const observer = new MutationObserver(hrefTest)
let urls = Array();
let anchors = document.querySelectorAll('a.islib');

anchors.forEach((a) => {
	a.addEventListener('click', (event) => {event.eventTarget.preventDefault()}, {
		capture: false,
		passive: true
	});
	observer.observe(a, observerOptions);
});

Next, copy and paste the code below.

// click every image
document.querySelectorAll('a.islib').forEach((elem) => {
	elem.click();
});

The code above will take time to process. On a page result of 100 images it can take 2 minutes or more. When it’s done (when all the images have finished loading), copy and paste the code below to get the final CSV.

window.open('data:text/csv;charset=utf-8,' + escape(urls.join('\n')));