I have a very large CSV file (over 100 million records) that I would like to delete several columns from. I have tried the application CSVed (http://csved.sjfrancke.nl/#csvuni) but a file this size will not open. Does anybody know what I would need to enter into the command line interface to delete specific columns? I am using Windows 7.
Below are the columns I currently have. I placed an "X" next to columns I would like to remove.
- domainName
- registrarName - X
- contactEmail - X
- whoisServer - X
- nameServers - X
- createdDate - X
- updatedDate - X
- expiresDate - X
- standardRegCreatedDate - X
- standardRegUpdatedDate - X
- standardRegExpiresDate - X
- status - X
- Audit_auditUpdatedDate - X
- registrant_email
- registrant_name
- registrant_organization
- registrant_street1
- registrant_street2
- registrant_street3
- registrant_street4
- registrant_city
- registrant_state
- registrant_postalCode
- registrant_country
- registrant_fax - X
- registrant_faxExt - X
- registrant_telephone
- registrant_telephoneExt
- administrativeContact_email
- administrativeContact_name
- administrativeContact_organization
- administrativeContact_street1
- administrativeContact_street2
- administrativeContact_street3
- administrativeContact_street4
- administrativeContact_city
- administrativeContact_state
- administrativeContact_postalCode
- administrativeContact_country
- administrativeContact_fax - X
- administrativeContact_faxExt - X
- administrativeContact_telephone
- administrativeContact_telephoneExt
What you need is called cut, and can get it (for example) from gnuwin32, package coreutils.
And once you have it,