Hi I’ve got json data that looks something like this:
JavaScript
x
24
24
1
{
2
"content": {
3
"Header 1": [
4
{
5
"name": "Name1",
6
7
}
8
},
9
{
10
"name": "Name2",
11
12
}
13
}
14
],
15
"Header 2": [
16
{
17
"name": "Name3",
18
19
}
20
}
21
],
22
}
23
}
24
I’m looking to convert this into lists that look something like this:
JavaScript
1
3
1
header1 = ["Name1", "Name2"]
2
header2 = ["Name3"]
3
So far I’ve been able to get all the names that I want using objectpath.
JavaScript
1
16
16
1
from importlib.resources import path
2
import json
3
from unicodedata import name
4
import objectpath
5
6
path = (r"C:Userspathexample.json")
7
8
with open(path) as json_file:
9
data = json.load(json_file)
10
11
tree_obj = objectpath.Tree(data)
12
13
names = list(tree_obj.execute('$..name'))
14
15
print (names)
16
But I’ve been unable to get the appropriate headers for each name as it appears to be nested under the ‘content’ header
Any help would be appreciated. Thanks :)
Advertisement
Answer
This does what you ask. Just iterate through the keys of “content”, and grab the keys in the subobjects.
JavaScript
1
30
30
1
import json
2
3
jsonx = """{
4
"content": {
5
"Header 1": [
6
{
7
"name": "Name1"
8
},
9
{
10
"name": "Name2"
11
}
12
],
13
"Header 2": [
14
{
15
"name": "Name3"
16
}
17
]
18
}
19
}"""
20
21
data = json.loads(jsonx)
22
gather = {}
23
24
for k, v in data["content"].items():
25
k1 = k.lower().replace(' ','')
26
v1 = [vv['name'] for vv in v]
27
gather[k1] = v1
28
29
print(gather)
30
Output:
JavaScript
1
2
1
{'header1': ['Name1', 'Name2'], 'header2': ['Name3']}
2
And for those who like one-liners:
JavaScript
1
4
1
gather = dict(
2
(k.lower().replace(' ',''),[vv['name'] for vv in v])
3
for k, v in data["content"].items() )
4