Questions about ImageDataBunch.from_name_re and ImageDataBunch.from_folder

evan.xiong · October 23, 2018, 8:13am

Hi, I have questions about loading the image datas with ImageDataBunch.

My data path is like. ./label_names/xxx.jpg

and I set:
path = Path(./)

If I use ImageDataBunch.from_name_re, it actually requires the name, which supposed to be get from
fname = get_image_files(path). However, this method does not really go recursively into the subfolders, so that fname will get return []. Question: Is there a way to let get_image_files recursively go into subfolders? (although a simple loop may do the job if only 1-layer deep)

If I use ImageDataBunch.from_folder, it requires u re-arrange your folders into ./train/label_names/xxx.jpg and ./valid/label_names/xxx.jpg. Well, although it is common practice to prepare the valid dataset, sometimes I might want them to be generated automatically? Is there a function in the lib to do this?

Thanks a lot.

jeremy · October 23, 2018, 9:54pm

You can create your file list any way you like. I’d suggest this:

https://docs.python.org/3/library/pathlib.html#pathlib.Path.glob

Gkarmakar · January 27, 2019, 11:51pm

How can I use ImageDataBunch where I have a numpy array of 60000 X 784 as image dataset where each image of total 60000 is 784 pixel flattened?

ste · January 28, 2019, 1:11am

This is an example of glob usage:

# Create list of all files
all_files = flat_list([d.glob('*') for d in path_train.glob('*')])
np.random.shuffle(all_files) # Ensure no bias from ordering
print('Files count: ' + str(len(all_files)))
print('sample: ', all_files[:10])
files = all_files # Assign files scope to all

github.com

artste/fastai-samples/blob/master/kaggle/lesson1-Distracted-Driver-Detection.ipynb

{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Lesson 1 - Are you a distracted driver?\n",
    "# Kaggle competition sample"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Try to address the Synthetic Digits dataset using concepts from lesson1/v3\n",
    "\n",
    "https://www.kaggle.com/c/state-farm-distracted-driver-detection\n"
   ]
  },
  {

This file has been truncated. show original