Skip to content

Tag: apache-beam

Read whole file in Apache Beam

Is it possible to read whole file (not line by line) in Apache Beam? For example, I want to read multiline JSONs, and my idea is to read file by file, extract data from each file and create PCollection from lists. Is it good idea or it’s better to preprocess source JSONs to one JSON file where each line…