I have below json
string loaded to dataframe
. Now I want to filter the record based on ossId
.
The condition I have is giving the error message. what is the correct way to filter by ossId?
JavaScript
x
46
46
1
import pandas as pd
2
3
data = """
4
{
5
"components": [
6
{
7
"ossId": 3946,
8
"project": "OALX",
9
"licenses": [
10
{
11
"name": "BSD 3",
12
"status": "APPROVED"
13
}
14
]
15
},
16
{
17
"ossId": 3946,
18
"project": "OALX",
19
"version": "OALX.client.ALL",
20
"licenses": [
21
{
22
"name": "GNU Lesser General Public License v2.1 or later",
23
"status": "APPROVED"
24
}
25
]
26
},
27
{
28
"ossId": 2550,
29
"project": "OALX",
30
"version": "OALX.webservice.ALL" ,
31
"licenses": [
32
{
33
"name": "MIT License",
34
"status": "APPROVED"
35
}
36
]
37
}
38
]
39
}
40
"""
41
42
df = pd.read_json(data)
43
print(df)
44
45
df1 = df[df["components"]["ossId"] == 2550]
46
Advertisement
Answer
I think your issue is due to the json structure. You are actually loading into df
a single row that is the whole list of field component
.
You should instead pass to the dataframe the list of records. Something like:
JavaScript
1
5
1
json_data = json.loads(data)
2
df = pd.DataFrame(json_data["components"])
3
4
filtered_data = df[df["ossId"] == 2550]
5