I have huge data and as a result I cannot hold all of it in memory and I always get out of memory errors; obviously one of the solutions would be using streaming in Node.JS; but streaming is not possible(as far as I know) with sorting which is one the functionalities which I apply on my data; is there any algorithm maybe Divide and conquer algorithm that I can use for the combination of streaming and sorting (which is one of the functionalities which I apply on my data?)
Streaming big data while sorting
867 Views Asked by user385729 At
1
There are 1 best solutions below
Related Questions in NODE.JS
- Using Puppeteer to scrape a public API only when the data changes
- How to request administrator rights?
- How do I link two models in mongoose?
- Variable inside a Variable, not updating
- Unable to Post Form Data to MongoDB because of picturepath
- Connection terminated unexpectedly while performing multi row insert using pg-promise
- Processing multiple forms in nodejs and postgresql
- Node.js Server + Socket.IO + Android Mobile Applicatoin XHR Polling Error...?
- How to change the Font Weight of a SelectValue component in React when a SelectItem is selected?
- My unban and ban commands arent showing when i put the slash
- how to make read only file/directory in Mac writable
- How can I outsource worker processes within a for loop?
- Get remote MKV file metadata using nodejs
- Adding google-profanity-words to web page
- Products aren't displayed after fetching data from mysql db (node.js & express)
Related Questions in ALGORITHM
- MCNP 6 - Doubts about cells
- Given partially sorted array of type x<y => first apperance of x comes before first of y, sort in average O(n)
- What is the algorithm behind math.gcd and why it is faster Euclidean algorithm?
- Purpose of last 2 while loops in the merge algorithm of merge sort sorting technique
- Dots and Boxes with apha-beta pruning
- What is the average and worst-case time complexity of my string searching algorithm?
- Building a School Schedule Generator
- TC problem 5-2:how to calculate the probability of the indicator random variable?
- LCA of a binary tree implemented in Python
- Identify the checksum algorithm
- Algorithm for finding a subset of nodes in a weighted connected graph such that the distance between any pair nodes are under a postive number?
- Creating an efficent and time-saving algorithm to find difference between greater than and lesser than combination
- Algorithm to find neighbours of point by distance with no repeats
- Asking code suggestions about data structure and algorithm
- Heap sort with multithreading
Related Questions in SORTING
- Sorting a List by its property renames all the objects in the List
- Does Sort() method in C# use recursion?
- ARM Assembly code is not executing in Vitis IDE
- Creating an efficent and time-saving algorithm to find difference between greater than and lesser than combination
- Heap sort with multithreading
- Laravel Livewire data table sorting livewire update payload
- basic MergeSort exercise
- How to import a range into a variant array in Excel VBA and sort using the sort method?
- Looker Studio | pivot chart - sorting by metric and last month
- how to create an array of multiples of 5 and display it in reverse
- matplotlib sort barh by values
- Custom Sorting Javascript with A-Z set
- Mainframe Programming Sorting, OUTFIL REMOVECC,NODETAIL
- Soft list based on another list
- SQL query : creating table with distinct values on selected columns
Related Questions in AMAZON-DYNAMODB
- S3 integration testing
- Why Scanning GSI on DynamoDb doesnt work as fast as expected when using CONTAINS?
- TypeError trying to upload a row to DynamoDB; Unsupported type "<class 'numpy.int64'>" for value "0"
- Terraform/AWS - Error while acquiring state lock
- Problems with Sets in Nested Schemas with Dynamoose
- DynamoDBIndexHashKey annotation on entity field make queries fail
- AWS DynamoDB QueryRequest multiple sort key values
- AWS insert lambda function values into DynamoDB
- How to integrate JIRA with AWS dynamodb service, If any story/task movement has been occured on jira dashabord it should reflect on the AWS dynamodb?
- Create custom IAM policy to hide S3 objects based on a catalog hosted in DynamoDB
- What is the relation between partition key and partition in dynamodb
- Enforce schema on schemaless DynamoDB database
- DynamoDB: Using SET on an UpdateExpression to increment a field is not working
- DynamoDB structure recommendation
- DynamoDB TTL Interaction When Dealing with A Single Partition Key
Related Questions in DIVIDE-AND-CONQUER
- Google interview question - check if all subarrays of an array have at least one unique element
- Find max product using divide and conqure in O(n) time
- How to compute the product of a n x n size Toeplitz matrix and a n-length vector in O(nlogn) time using FFT algorithm
- Proof of correctness for algorithm to find the median of the union of two sorted arrays
- Is there an equivalent to divide and conquer when trying to identify multiple items?
- Divide and conquer algorithm problem applied to an n x n-matrix
- How can I adjust my code to use binary search to guess the number I am thinking in my head
- XSLT recursion crashes after 1000 calls - how to transform it to DVC style?
- labeling graph edges
- Restore the original array after merge Sort based on it's steps
- binary search left and right index to find median of two sorted arrays
- Divide and Conquer max profit algorithm
- How to add `n log n` stones to a grid to form a beautiful arrangement using divide-and-conquer? - algorithm idea
- How will the stack be formed by Recursion in the MergeSort function?
- How can I find all the matches of two sorted arrays in O(n) with limitations on number of comparisons?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
You can stream the data using Kinesis and use the Kinesis Client Library, or subscribe a Lambda function to your Kinesis stream and incrementally maintain sorted materialized views. Where you store your sorted materialized views and how you divide your data will depend on your application. If you cannot store the entire sorted materialized views, you could have rolling views. If your data is time-series, or has some other natural order, you could divide the range of your ordered attribute into chunks. Then, you could have for example, 1-day or 1-hour sorted chunks of your data. In other words, choose the sorted subdivision that allows you to keep the information in memory as needed.