CSV files are actually in LRECL format

From Data-gov Wiki

Jump to: navigation, search
Infobox (Issue Report) edit with form
  • name: CSV files are actually in LRECL format

Current Issues in data.gov


"CSV" files are actually text files in LRECL format: Instead of having the fields delimited by commas, each field consists of the characters starting from one set column until a set ending column end.

The field names themselves do not appear in the data file, but are usually in some other related file, along with the field lengths and start columns. Sometimes this information comes packaged in a ZIP file along with the data, and sometimes it can be found via the metadata, technical documentation, or data dictionary variable list properties. Frequently the fields are in a PDF, though sometimes they are in an SPS or other type of file.

It is possible to parse and convert these into RDF, but it is a more difficult process because some manual work is involved in getting the field names and lengths of fields.


Related Tags

SMWiki: Category:Fixed-width-column txt
Google Spreadsheet: fixed-width (under layout) - Category:Datafile layout fixed-width

Facts about CSV files are actually in LRECL formatRDF feed
Dcterms:created24 June 2010  +
Dcterms:creatorSarah Magidson  +
Foaf:nameCSV files are actually in LRECL format
Related tagDatafile layout fixed-width  +
Skos:altLabelCSV files are actually in LRECL format  +, csv files are actually in lrecl format  +, and CSV FILES ARE ACTUALLY IN LRECL FORMAT  +
Personal tools
internal pages