I would like to transform a JSON with the following input:
{
"039.png" : [
"finding1"
],
"006.png" : [
"finding1",
"finding2"
],
"012.png" : [
"nofinding"
]}
into a data frame with the following layout:
| image | finding1 | finding2 | nofinding |
| ------- | -------- | -------- | --------- |
| 039.png | true | false | false |
| 006.png | true | true | false |
| 012.png | false | false | true |
By now I have tried:
import pandas as pd
import json
from pandas.io.json import json_normalize
with open('file.json') as data_file:
data = json.load(data_file)
df = pd.json_normalize(data)
df = df.transpose()
df.head()
However, this results in:
039.png [finding1]
006.png [finding1, finding2]
012.png [nofinding]
In addition, I would also like to program the return path to be able to create the given JSON format from the desired table format
many thanks in advance
It's straightforward with
get_dummies
:Output: