how to load 1000 lines of a csv into elasticsearch as 1000 different documents using elasticsearch API

Question

so ive been trying to load 1000 lines of a csv into elasticsearhc as 1000 different documents, the csv has 8 headers: telease year, title, origin/ethnicity, director, cast, wiki page, plot. my current code for loading the dataset loads it using the bulk command from helpers this i think loads 1000 lines into one document. Answer You are on the

Accepted Answer

You are on the right path, the code below will split the csv into 1000 different items but splitting out the headers and turning each line item into a map/dictionary item with appropriate headers. This is then appended to a list so you upload a list of dictionary items.import csv, sysfrom elasticsearch import helpers, Elasticsearch, RequestsHttpConnectiones = Elasticsearch(    hosts=[{        'host': 'localhost',        'port': '9200'}],    use_ssl=False,    verify_certs=True,    connection_class=RequestsHttpConnection)upload_list = [] # list of items for upload# Load all csv datawith open('my_folder/my_csv_file.csv', newline='') as csvfile:        data_list = []    csv_data = csv.reader(csvfile)    for row in csv_data:        data_list.append(row)    # separate out the headers from the main data     headers = data_list[0]    # drop headers from data_list    data_list.pop(0)    for item in data_list: # iterate over each row/item in the csv        item_dict = {}        # match a column header to the row data for an item        i = 0        for header in headers:            item_dict[header] = item[i]            i = i+1        # add the transformed item/row to a list of dicts        upload_list += [item_dict]# using helper library's Bulk API to index list of Elasticsearch docstry:    resp = helpers.bulk(        es,        upload_list,        index="my-index-name"    )    msg = "helpers.bulk() RESPONSE: " + str(resp)    print(msg) # print the response returned by Elasticsearchexcept Exception as err:    msg = "Elasticsearch helpers.bulk() ERROR: " + str(err)    print(msg)    sys.exit(1)

Advertisement

Answer