While working on a project, hosted locally, I'm stuck at managing CSV uploads. One of tasks require me to upload data on daily basis that has either new entries or updated status for existing entries. There is also an probability that some of the entries (that exists in database) has no updated status.
Problem statement;
I've created a CSV upload feature that uploads the CSV file to a particular location and imports the information in assigned TABLE. I want to know on what is the best way to verify the database records when I do the CSV upload.
It should ideally work as following;
- if entry doesn't exists (INSERT new entry basis data from CSV file)
- if the entry exists and has status SAME as the new uploaded CSV file (IGNORE & do nothing)
- if the entry exists and has DIFFERENT status than the one in new uploaded CSV file (UPDATE status to what is mentioned in CSV file)
Database / CSV file structure
- tracking_id (auto increment)
- odanumber (uploaded through CSV & can have duplicate entries)
- airwaybill (uploaded through CSV & UNIQUE)
- courierful (uploaded through CSV & can have duplicate entries)
- delstatus (uploaded through CSV & is what gets updated mostly)
- deliverydate (uploaded through CSV & gets updated with each delivery)
From the above, delstatus gets updated almost each time (for existing entries) the new CSV is uploaded and hence needs to be checked.
I assume that we can pick 'airwaybill' to check if it exists, and if it does, check if the delstatus is same as of CSV file or update. If 'airwaybill' doesn't exist then a new records must be added to the database. As that would save me from entering all records in database unnecessarily. Or can be done may be in a better way (that I'm yet to explore).
What's happening right now;
I'm able to upload the complete set of CSV file, creating new entries in database through following code.
<?php
if(isset($_POST['csv']))
{
$sqlname= 'localhost';
$username= 'root';
$table= 'tracking';
$password= '';
$db='aatrack';
$file=$_POST['csv'];
$cons= mysqli_connect("$sqlname", "$username","$password","$db") or die(mysql_error());
$result1=mysqli_query($cons,"select count(*) count from $table");
$r1=mysqli_fetch_array($result1);
$count1=(int)$r1['count'];
mysqli_query($cons, '
LOAD DATA LOCAL INFILE "'.$file.'"
INTO TABLE '.$table.'
FIELDS TERMINATED by \',\'
LINES TERMINATED BY \'\n\'
IGNORE 1 LINES
')or die(mysql_error());
$result2=mysqli_query($cons,"select count(*) count from $table");
$r2=mysqli_fetch_array($result2);
$count2=(int)$r2['count'];
$count=$count2-$count1;
if($count>0)
{
header("location:success.php?id=$count");
}
}
?>
Can you please help in guiding the best way possible to achieve the same. I understand that it can be done by first uploading the information to a temp_table and comparing the same before entries are updated in the LIVE table.
Please suggest an optimum way to achieve the results.
Thank you for reading this far.
Best regards,
Amit Agnihotri
How LOAD DATA INFILE works
Based on an UNIQUE index, LOAD DATA INFILE inserts a new record or updates an existing one (only if the REPLACE option is active).
(1) Regarding insert:
If the csv input value for the UNIQUE index column is NOT found in the db table, then a new record is added, with the (defined) input values from csv file.
(2) Regarding update:
If the csv input value for the UNIQUE index column is found in the db table, then the LOAD DATA INIFILE query performs the following operations (in this order!):
NB: In the rest of my answer I will speak only about the update part (2).
BEFORE INSERT-TRIGGER as solution for conditional updates
Since LOAD DATA INFILE runs an insert operation before a delete one, you can make use of the fact that the old db record still exists when the new record with the csv values is inserted. So, you can customize your new input values based on the values contained in the old record. The really cool part of this is: you can even maintain the old value of the PRIMARY KEY field.
The key is to define a BEFORE INSERT-TRIGGER in which all the needed customizations, validations and assignments reside:
Then perform the LOAD DATA INFILE query from PHP.
The codes
Create table syntax:
BEFORE INSERT-TRIGGER:
CSV file (tracking.csv)
LOAD DATA INFILE function (called from PHP)
Notes:
*) In regards of LOAD DATA INFILE, it can be that you run into the error:
It means: The LOAD DATA INFILE has no permission to read the csv file. So you must set secure-file-priv in the configuration file of your database (my.cnf, or my.ini) yourself. Like this:
*) You can NOT define a stored procedure from which to run the LOAD DATA INFILE.
In the end, there are also other solutions involving temporary tables, which, no doubt, can work perfectly. One of them is presented in this great article. So, the trigger solution is just another approach.
Good luck!