With a pandas DataFrame df
:
x y
0 29-04-2014 07:40:52 07:40:52
1 29-04-2014 07:59:25 07:15:00
2 NaN NaN
3 29-04-2014 08:16:12 07:45:00
4 09-05-2014 08:19:14 07:30:00
5 23-04-2014 08:27:16 08:30:00
6 29-04-2014 08:37:16 07:00:00
7 NaN NaN
8 29-04-2014 08:41:16 07:30:00
9 25-03-2014 08:42:16 07:30:00
where columns x
contains the Date-Month-Year Hour:Minute:Second
and y
stores the Hour:Minute:Second
, how can one extract only:
- the date [
29
from row 0] of columnx
, - the month [
04
orApril
from row 0] of columnx
, - the month and date [
29-04
or29-April
from row 0] of columnx
, - the hour and minute [
07-40
from row 0] of columnsx
andy
I imported the DataFrame from a text file using
df = pd.read_table("C:\data.txt, sep= '\t'")
but it was originally from MS Excel or MS Access.
When I run df.dtypes
, I got
x object
y object
dtype: object
I am working of Pandas version 0.14.1
in Python 3.4
.
Example DataFrame
import numpy as np
import pandas a pd
df = pd.DataFrame({'x': ['29-04-2014 07:40:52', np.nan, '29-04-2014 08:16:16','29-04-2014 08:19:56', '29-04-2014 08:27:20'],
'y': ['07:40:52', '07:15:00', np.nan, '07:45:00', '07:30:00']})
I think the Pandas way is to make x your index, then you can use some simple methods to extract what you want. The non-Pandas way is to use datetime module.
Pandas way... For background you can read the documentation around timeseries data which is quite good.
set up some example data:
Note that it does not appear your data is set with x as the index. That's an important step.
After you have your dates as a timestap you can access what you are after: