I have a huge DataFrame (~4 million rows) and I need to search it for a row which has specific columns values for about a million time. Based on the conditions governing my problem, there is only one true answer (one row) for each query. So as soon as the search finds the first result, there's no need to continue the search. But as we know df.loc[df['column']==value] has to read all the data every time! even if the first row satisfies the search conditions, other 4 million rows has to be read and evaluated! which creates a huge overhead for the search. Is there a way to get the first row satisfying the search conditions without reading and evaluating the rest of the rows?
How to search a pandas DataFrame for the first row satisfying set of conditions without reading the rest of the rows?
241 Views Asked by MHDMYZ At
1
There are 1 best solutions below
Related Questions in PYTHON
- new thread blocks main thread
- Extracting viewCount & SubscriberCount from YouTube API V3 for a given channel, where channelID does not equal userID
- Display images on Django Template Site
- Difference between list() and dict() with generators
- How can I serialize a numpy array while preserving matrix dimensions?
- Protractor did not run properly when using browser.wait, msg: "Wait timed out after XXXms"
- Why is my program adding int as string (4+7 = 47)?
- store numpy array in mysql
- how to omit the less frequent words from a dictionary in python?
- Update a text file with ( new words+ \n ) after the words is appended into a list
- python how to write list of lists to file
- Removing URL features from tokens in NLTK
- Optimizing for Social Leaderboards
- Python : Get size of string in bytes
- What is the code of the sorted function?
Related Questions in PANDAS
- object of type 'float' has no len() when using to_stata
- Pandas date ranges and averaging the counts
- Using Pandas how do I deduplicate a file being read in chunks?
- How to count distance to the previous zero in pandas series?
- Succint way of handling missing observations in numpy.cov?
- Pandas and GeoPandas indexing and slicing
- convert kenneth French data to daily datetime format in python
- keep timezone "CET" from convert into "CEST" in python
- Calculating the difference in dates in a Pandas GroupBy object
- python.exe crashes down while interpreting 'read_csv' command of pandas library
- Column is not appended to pandas DataFrame
- reshaping and rearranging a pandas table
- csv parsing and manipulation using python
- Using StringIO with pandas.read_csv keyword arguments
- Pandas is installed but import pandas throws error
Related Questions in DATAFRAME
- Extract series of observations from dataframe for complete sets of data
- R: Avoid loop or row apply function
- using apply with an anonymous function which uses specific locations in the row
- R dplyr - error in subsetting of local data frame
- subtract column1 (dataframe1) from column2 (dataframe2) based on matching column in both R
- How to get maximum value from a column in a data.frame and get ALL records
- Column is not appended to pandas DataFrame
- Convert list of overlapping data.frames into single data.frame
- XML to data frame with missing nodes
- Summing multiple columns to equal -1,0,1
- Apply function iteratively across a dataframe
- How to parse data from .TX0 file into dataframe
- Join 2 DataFrames on an index without introducing nans on missing indices
- Convert list returned by sapply() to a data.frame
- How to replace values in a data frame with another value
Related Questions in SEARCH
- SQL weight rows by formula
- If Input is focused trigger X else trigger Y
- laravel full-text search with multiple keywords together
- Login form by using a new database, made in VB
- How to search for overloaded methods in a class
- Modifying Tries code in Java
- Doing a multi-column search for an item in a listView control using c#
- T SQL wildcard searching for a zip code
- django rest framework search filter all fields
- how to filter search result with dropdown list in php
- PHP/MySQL search... show all data by default, or show matched data
- Oracle multiple REPLACE options in REGEXP_REPLACE
- Is there a way to get all complete sentences that a search engine (e.g. Google) has indexed that contain two search terms?
- How to search a unknown composite key for dictionary in O(1) in c#
- android java search listview clickedItem
Related Questions in PANDAS-LOC
- Pandas loc() method with boolean array on axis 1
- Loc function on conditions in pandas returns Nan
- Why/How does Pandas use square brackets with .loc and .iloc?
- Selection over different columns after a groupby
- Need to extract or remove columns from python
- pandas conditional loc append values
- Using idxmax and idxmin to change values in different rows
- Why does df.loc[] return the same row twice, when there is only one?
- How to change array to not array?
- Pandas loc error: 'Series' objects are mutable, thus they cannot be hashed
- Select rows by indices - Pandas loc - any missing labels
- Dataframe loc with parameter str
- Pandas: how to retrieve values from a DataFrame given a list of (row, column) pairs?
- Concatenating Zero to every row that has len() == 1.0 using loc[] methd
- Selecting Rows that only match the column values in another data frame in Python
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
firstly you have to set that column as index (as you said you have no duplicated value). then change your data frame to dictionary, and then search your value.