how i can enter a path and create hdf5 file with all csv file in the same path?

111 Views Asked by At

I tried this script but I am not able to see the hdf5 file, the script running without errors but I don't see any thing.

import glob
import os
import pandas as pd

# inputs
path = input('Insert the directory path:')
group = input('Insert a group name: ')

# create a list of file paths
file_list = [file for file in glob.glob(path)]
# dict comprehension to create keys from file name and values from the csv files
dfs = {os.path.basename(os.path.normpath(filename)).split('.')[0]: pd.read_csv(filename) for filename in file_list}

# loop though the dataframes
for k,df in dfs.items():
    # store the HDF5 file
    store = pd.HDFStore('test.h5')
    # append df to a group and assign the key with f-strings
    store.append(f'{group}/{k}', df, format='table', data_columns=df.columns)
    # close the file
    store.close()
3

There are 3 best solutions below

2
On

Try adding the path to store:

store = pd.HDFStore('test.h5', mode='w')
0
On
import glob
import os
import pandas as pd

# inputs
pattern = input('Insert the file pattern:')
group = input('Insert a group name: ')

# create a list of file paths
# file_list = [file for file in glob.iglob(path)]
# pattern exp:d:\test\*.csv
file_list = list(glob.iglob(pattern))
# dict comprehension to create keys from file name and values from the csv files
dfs = {os.path.basename(os.path.normpath(filename)).split('.')[0]: pd.read_csv(filename) for filename in file_list}

# store the HDF5 file
store = pd.HDFStore('test.h5')
# loop though the dataframes
for k,df in dfs.items():
    # append df to a group and assign the key with f-strings
    store.append(f'{group}/{k}', df, format='table', data_columns=df.columns)
    # close the file
store.close()
2
On

You have to define your path to read all csv files within. At the end of your path string, just add \*.csv, like this:

C:\Users\foo\bar\*.csv