Bing account expired, what other option

paidion · August 21, 2020, 5:10am

For the course_v4, 2nd ipynb-- production. I was not able to get the Azure key. My free bing search api account expired. I am not able to get the key. What can I do?

mrfabulous1 · August 21, 2020, 5:40am

Hi paidion hope all is well

See this post Beginner: Following the download.ipynb for creating a dataset and other posts on downloading images using Javascript.

Create you own data sets, upload them, skip the Azure part in the notebook and proceed to this section.

dest = 'images/grizzly.jpg'
download_url(ims[0], dest)

Also Look at how the dataset and images are used in https://github.com/fastai/fastbook/blob/master/04_mnist_basics.ipynb for the 3, 7 classifier.

cheers hope this helps

mrfabulous1

paidion · August 21, 2020, 5:58am

Thanks you for your suggestions.

I tried the bing image search api a couple months ago. It’s rather easy and straight forwards. I also tried the google download, it was working with fastai v.1. As I am thinking to use the data for longer term after the course. I also have the copyright issues.
I will try to wait and see. How other might help with other alternatives.

Thanks again.

joedockrill · August 21, 2020, 8:03am

here’s my alternative:

joedockrill · August 21, 2020, 2:56pm

(@jeremy as requested)

alternative #2 thanks to @IegorT, for those of you who prefer to use the javascript console/search results page method, the following bits of javascript should work in duckduckgo and bing.

duckduckgo:

urls=Array.from(document.querySelectorAll("img.tile--img__img")).map(el=> el.hasAttribute('data-src')?el.getAttribute('data-src'):el.getAttribute('src')).map(src => decodeURIComponent(src.split("?u=")[1]));
window.open('data:text/csv;charset=utf-8,' + escape(urls.join('\n')));

bing:

urls=Array.from(document.querySelectorAll("a.iusc")).reduce((acc, el) => el.hasAttribute("m") ? acc.concat(JSON.parse(el.getAttribute("m")).murl) : acc, [])
window.open('data:text/csv;charset=utf-8,' + escape(urls.join('\n')));

JorgeBriones · August 22, 2020, 7:44pm

Hi Jeremiah,
After the free time expired on my azure account, I signed up for the paid service. Just got to make sure to shut down services when you are done & regenerate keys. This made it simpler for me to continue to follow this course.

kakods · August 24, 2020, 5:42pm

@joedockrill Thanks for that trick. Do you have any idea if Bing TOS is different to Google?

joedockrill · August 25, 2020, 6:06am

No idea

jimmiemunyi · September 6, 2020, 9:27am

@joedockrill thanks for your solution!
question, is there a way to turn the solution to a pypi package that can just be installed with pip?

joedockrill · September 6, 2020, 9:33am

Which solution?? One is a notebook and the other is JavaScript. Either way I’m going with no.

jimmiemunyi · September 6, 2020, 9:38am

oh apologies, maybe i didn’t understand it. lemme read it again (the notebook).

i was thinking of having a similar function like the one provided by jeremy
bing_image_search(search_term)

and i saw sth like
duckduckgo_search()

in your notebook. cant we just pip install sth and use the duckduckgo_search() function directly?

joedockrill · September 6, 2020, 9:42am

Yeah I was just coming back to reply again. It’s early here and I was dealing with a whiney teenager at the same time. Yes, I don’t see why not. I’ll look into it soon.

jimmiemunyi · September 6, 2020, 9:44am

no prob!
again thanks @joedockrill for your current solution.
i’ve just tried it out, works like a charm.

false · September 21, 2020, 1:19pm

for other users looking for pip package, there’s bing-image-downloader, that does the job in 2 lines of code.

Install with:
pip install bing-image-downloader

And then:

from bing_image_downloader import downloader
downloader.download(query_string, limit=100,  output_dir='dataset', adult_filter_off=True, force_replace=False, timeout=60)

joedockrill · September 22, 2020, 3:40pm

i did actually turn mine into a package like @jimmiemunyi asked.

!pip install -q jmd_imagescraper

from pathlib import Path
root = Path().cwd()/"images"

from jmd_imagescraper.core import *
duckduckgo_search(root, "Cats", "cute kittens", max_results=100)
duckduckgo_search(root, "Dogs", "cute puppies", max_results=100)

also

from jmd_imagescraper.imagecleaner import *
display_image_cleaner(root)

mike.moloch · January 26, 2022, 12:10am

This is brilliant! Thank you for doing this for the community!