I cannot find this in the Hazelcast Jet 5.0 (or 4.x) documentation, so I hope someone can answer this here - can a reliable topic be used as an idempotent sink, for example to de-duplicate events coming from two identical unreliable sources (like a websocket). Or should I use an explicit event de-duplication as suggested at https://hazelcast.com/blog/stream-deduplication-with-hazelcast-jet/? Or is there a better way to cope with unreliable sources like websockets (I mean for the case I don't want to miss events ingested over a websocket, and there is non-zero chance that a single websocket instance might fail)?
Are Hazelcast Jet Reliable Topic Sinks idempotent? (Hazelcast fault-tolerance of a websocket source)
46 Views Asked by siddhadev At
2
There are 2 best solutions below
Related Questions in HAZELCAST
- On Premise Hazlecast Implementation Migration to Kubernetes
- Hazelcast 3.12.13 - Random disconnects in a 5 node cluster on GCP VMs
- Hazelcast Kubernetes Compatibility with Hazelcast > 5.3.1
- How should Hazelcast dynamically allocated FencedLocks be destroyed
- Hazelcast + Kotlin: How to give proper access to required Java packages?
- how to upgrade Hazelcast for java app in k8s from 4.x to 5.3.6?
- Consistency of replicated map across cluster nodes
- Hazelcast Management Center can't connect to cluster when providing custom serializer
- Use EventListener causing circular dependency with repository
- Use ConcurrentHashMap to replace HazelCast IMap with locking
- Can Hazelcast Leader Election Clustering work in OpenShift Containers without hz operators?
- Purpose of backups in Hazelcast
- Using Hazelcast for session management in Spring Non Boot Application
- Hazelcast clear()/isEmpty()/size() calls on IMaps getting stuck
- Hazelcast error - unable to store entries where MapStoreConfig is missing
Related Questions in STREAM-PROCESSING
- Running distributed applications using stream processing?
- Order Preserving Union/Merge Executor in Flink
- State Store or SQL Database?
- Using KafkaStreams TimeWindows for detecting timeout and expiry
- Apache Beam portable runner to run golang based job on flink
- Kafka streaming vs gRPC batching time series data
- Server Logs aggregation/reduction with KsqlDB
- Are checkpoints needed for a stream processed in databricks job running via continuous trigger?
- Apache flink job not printing anything to stdout and not sending to Kafka sink topic
- How to access the results from ProcessFunction?
- Good way to parallel process elements of 'stream' while keeping output in order
- Unable to perform 'maxBy' on a Flink windowed stream of type <SampleClass> by using the SampleClass's field as a parameter does not work
- Is there a suitable python library for doing stream processing with Kafka topics?
- In Kafka Streams, how do you parallelize complex operations (or sub-topologies) using multiple topics and partitions?
- Is it possable to stream images to a PDF using tesseract?
Related Questions in HAZELCAST-JET
- Reduce the DB workload by moving the Procedure to Hazelcast (reading from Map or alternatively get form DB)
- Synchronizing Hazelcast IMap with the database postgres
- Perform Left,Right and Inner Join in Hazelcast Jet in Stateless manner
- Hazelcast Jet after rollingAggregate how to get each grouped values in one go only?
- Hazelcast Jet not removing Jobs after completion?
- Can Hazelcast jet be used for batch based workload?
- what is purpose of JobConfig.addCustomClasspath() method in Hazelcastjet
- Does Hazelcast Jet remoteMap sink make a putTransient call on the remote map?
- How does Hazelcast Jet assign task-to-CPU priority?
- Hazelcast Jet Job is not able to access spring context
- Hazelcast Predicate SQL into map
- How to read and write Identified Data Serializable in Hazelcast
- Problem trying to configure an interface and connect to the cluster
- Can we read parquet file in Hazelcast jet?
- Aggregation and grouping using Hazelcast jet
Related Questions in EVENT-STREAM-PROCESSING
- Replaying event store
- Event stream disk size
- Flink SQL: Emitting an event after a certain time has passed after original event was recorded
- Server Logs aggregation/reduction with KsqlDB
- WSO2 Stream Processor (SP) Error "could not initialize carbon deployment engine"
- Kappa architecture - conceptual question about historical data processing
- Sink for user activity data stream to build Online ML model
- system design - How to update cache only after persisted to database?
- How to trigger a line of code in window based CEP
- Recommendations to store streaming events
- Stream processing alternatives
- Are Hazelcast Jet Reliable Topic Sinks idempotent? (Hazelcast fault-tolerance of a websocket source)
- Is it possible to achieve Exacly Once Semantics using a BASE-fashioned database?
- Issues connecting Snowplow with Kafka on Confluent Cloud
- How to produce messages with consecutive numbers with Kafka?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Any queue can't in general be used for de-duplication. If you offer the same item twice, it has no means to ignore such call, for that it would have to store the identifiers from the entire history, or you have to specify storage limits like in the example you linked where the TTL attribute of
filterStatefulis used.