cost control: dedupe writeEvents #422

rebelArtists · 2024-11-05T14:16:01Z

Description

deduplicate blockEvents by unique: sensorID, peerID, and hash combos
add sharded mutex maps to prevent goRoutine contention
add dedupe map cleanup periodically to prevent memory exhaustion

Testing

smoke tests locally, no go errors on run
unclear how to test event writing locally

minhd-vu · 2024-11-05T16:01:22Z

Hmm, we may not need this concept of sharding. Take a look at this:

type DatastoreEventKey struct {
	SensorId string
	PeerID   string
	Hash     string
}

// writeEvent writes either a block or transaction event to datastore depending
// on the provided eventKind and hashKind.
func (d *Datastore) writeEvent(ctx context.Context, peer *enode.Node, eventKind string, hash common.Hash, hashKind string, tfs time.Time) {
	dek := DatastoreEventKey{
		SensorId: d.sensorID,
		PeerID:   peer.URLv4(),
		Hash:     hash.Hex(),
	}

	data, err := json.Marshal(dek)
	if err != nil {
		log.Error().Err(err).Any("datastore_event_key", dek).Msg("Failed to marshal datastore event key")
		return
	}

	checksum := sha256.Sum256(data)
	// We generate the name key from the hash of the sensor ID, peer ID, and
	// transaction hash.
	key := datastore.NameKey(BlockEventsKind, hex.EncodeToString(checksum[:]), nil)

	event := DatastoreEvent{
		SensorId: d.sensorID,
		PeerId:   peer.URLv4(),
		Hash:     datastore.NameKey(hashKind, hash.Hex(), nil),
		Time:     tfs,
		TTL:      tfs.Add(d.ttl),
	}

	_, err = d.client.RunInTransaction(ctx, func(tx *datastore.Transaction) error {
		var de DatastoreEvent
		// Check if the event already exists.
		if err := tx.Get(key, &de); err == nil {
			// Skip writing if event already exists.
			return nil
		}

		_, err := tx.Put(key, &event)
		return err
	})

	if err != nil {
		log.Error().Err(err).Msgf("Failed to write to %v", eventKind)
	}
}

This might be a simpler solution, it just checks datastore if the event already exists, if it doesn't exist, then we write the event. This would increase the number of reads, but would decrease the number of writes.

I think your solution may actually be more optimal since, we don't need to do reads at all, but I'm unsure of the memory footprint.

minhd-vu · 2024-11-05T16:06:10Z

p2p/database/datastore.go

+			mu.Lock()
+			for key, timestamp := range shard {
+				if timestamp.Before(expiryTime) {
+					delete(shard, key) // Remove stale entry


The thing I encountered before is that delete on maps doesn't actually decrease the size of the map (but it does clear the key). It retains the maximum size that it was, which caused some memory issues in earlier sensor iteraitons.

interesting, thanks for pointing out, will reexamine

adjusted per your feedback

rebelArtists · 2024-11-06T11:04:00Z

This might be a simpler solution, it just checks datastore if the event already exists, if it doesn't exist, then we write the event. This would increase the number of reads, but would decrease the number of writes.

I think your solution may actually be more optimal since, we don't need to do reads at all, but I'm unsure of the memory footprint.

Thanks @minhd-vu for the suggestion—it's definitely a valuable approach worth considering. However, given the high concurrency and low-latency requirements here, I believe in-memory deduplication is more suitable. By avoiding additional reads to check for duplicates in Datastore, we eliminate potential latency bottlenecks and reduce dependency on external calls, which is critical for our ingestion to remain fast under heavy load. Additionally, frequent reads would likely increase our data egress costs from Datastore to sensor nodes, impacting our overall efficiency and budget. While the in-memory approach has a higher memory footprint, we’ll be careful with memory cleanup logic to keep resource usage efficient and avoid potential issues

minhd-vu · 2024-11-07T04:59:05Z

@rebelArtists make sense dan, for an in-memory solution i would propose storing a linked list in the conn struct in the protocol.go file. A new conn struct is made for each peer, and sensors do not open duplicate connections. Therefore anything stored in here will be unique to the sensor-peer combination. If you had a linked list that stored the block hashes we seens with the time, you could avoid mutexes, and the sharding etc.

// conn represents an individual connection with a peer.
type conn struct {
	sensorID  string
	node      *enode.Node
	logger    zerolog.Logger
	rw        ethp2p.MsgReadWriter
	db        database.Database
	head      *HeadBlock
	headMutex *sync.RWMutex
	counter   *prometheus.CounterVec
	name      string

	// requests is used to store the request ID and the block hash. This is used
	// when fetching block bodies because the eth protocol block bodies do not
	// contain information about the block hash.
	requests   *list.List
	requestNum uint64

+       blockHashes *list.List

	// oldestBlock stores the first block the sensor has seen so when fetching
	// parent blocks, it does not request blocks older than this.
	oldestBlock *types.Header
}

Let me know your thoughts

rebelArtists · 2024-11-07T10:03:11Z

If you had a linked list that stored the block hashes we seens with the time, you could avoid mutexes, and the sharding etc.
Let me know your thoughts

it's a good thought, and would keep mem requirements lower. i will explore this direction today

rebelArtists · 2024-11-07T10:55:16Z

@minhd-vu new draft pr w/ 1st stab at linked list implementation: https://github.com/0xPolygon/polygon-cli/pull/423/files

curious your thoughts when you get a chance

minhd-vu · 2024-11-13T15:22:30Z

Closing this in favor of #423

rebelArtists added 2 commits November 5, 2024 15:09

dedupe writeEvents

b1aa1c4

go fmt

c928bb3

minhd-vu reviewed Nov 5, 2024

View reviewed changes

rebelArtists added 3 commits November 6, 2024 13:40

proper mem cleanup

0d8c3ef

add shard size logging

e52fa8f

adjust TTL config

136eb1f

rebelArtists requested a review from minhd-vu November 6, 2024 15:33

minhd-vu closed this Nov 13, 2024

minhd-vu deleted the dan/dedupe_sensor_data branch November 13, 2024 15:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cost control: dedupe writeEvents #422

cost control: dedupe writeEvents #422

rebelArtists commented Nov 5, 2024

minhd-vu commented Nov 5, 2024 •

edited

Loading

minhd-vu Nov 5, 2024

rebelArtists Nov 6, 2024

rebelArtists Nov 6, 2024

rebelArtists commented Nov 6, 2024 •

edited

Loading

minhd-vu commented Nov 7, 2024 •

edited

Loading

rebelArtists commented Nov 7, 2024

rebelArtists commented Nov 7, 2024

minhd-vu commented Nov 13, 2024

cost control: dedupe writeEvents #422

cost control: dedupe writeEvents #422

Conversation

rebelArtists commented Nov 5, 2024

Description

Testing

minhd-vu commented Nov 5, 2024 • edited Loading

minhd-vu Nov 5, 2024

Choose a reason for hiding this comment

rebelArtists Nov 6, 2024

Choose a reason for hiding this comment

rebelArtists Nov 6, 2024

Choose a reason for hiding this comment

rebelArtists commented Nov 6, 2024 • edited Loading

minhd-vu commented Nov 7, 2024 • edited Loading

rebelArtists commented Nov 7, 2024

rebelArtists commented Nov 7, 2024

minhd-vu commented Nov 13, 2024

minhd-vu commented Nov 5, 2024 •

edited

Loading

rebelArtists commented Nov 6, 2024 •

edited

Loading

minhd-vu commented Nov 7, 2024 •

edited

Loading