News:

Masm32 SDK description, downloads and other helpful links
Message to All Guests
NB: Posting URL's See here: Posted URL Change

Main Menu

xxHash

Started by Vortex, January 22, 2023, 07:55:13 PM

Previous topic - Next topic

Vortex

QuotexxHash is an Extremely fast Hash algorithm, processing at RAM speed limits. Code is highly portable, and produces hashes identical across all platforms (little / big endian). The library includes the following algorithms :

XXH32 : generates 32-bit hashes, using 32-bit arithmetic
XXH64 : generates 64-bit hashes, using 64-bit arithmetic
XXH3 (since v0.8.0): generates 64 or 128-bit hashes, using vectorized arithmetic. The 128-bit variant is called XXH128.

https://github.com/Cyan4973/xxHash

Biterider

Thanks Vortex
Very useful information!  :thumbsup:
The benchmark data is very interesting.
Let's see if we can use them...

Biterider

jack

had look at the collisions test
Quote
The test requires a very large amount of memory. By default, it will generate 24 billion of 64-bit hashes, requiring 192 GB of RAM for their storage
that's a bit more memory than what my PC has  :biggrin:
one thing that has always bothered me about hash tables is collisions, how you deal with a collision?

LiaoMi

#3
:tongue: :thup:
Hash tables for ultra fast dictionaries - http://masm32.com/board/index.php?topic=9754.msg107371#msg107371

jj2007

Have you noticed that there is a big difference between searching with Google vs using Forum search?

Google finds 12 matches for "exgetsel" (with quotes) in the whole Internet.
Forum search finds 18 matches, in this forum only.

How could this be improved using hash tables?

NoCforMe

So maybe the mighty vaunted Google ain't the miraculous collection of algorithms that it's cracked up to be, eh? Who woulda thunk it?

(OTOH, it's a hell of a lot better than Duck Duck Go, which I no longer use: I love its privacy protections, but its search results are quite inferior to Google's.)
Assembly language programming should be fun. That's why I do it.

jj2007

The explanation is that Google uses fast hash tables, while forum search uses slower algos.

NoCforMe

So the tortoise wins the race ...
Assembly language programming should be fun. That's why I do it.

jj2007

Yep. Google uses hash tables, and therefore can find only full words. No partial search :cool:

mineiro

#9
You can check this link, sound that authors are talking about:
https://encode.su/threads/2556-Improving-xxHash
I'd rather be this ambulant metamorphosis than to have that old opinion about everything

jj2007

Quote from: mineiro on January 23, 2023, 12:27:35 PM
You can check this link, sound that authors are talking about:
https://encode.su/threads/2556-Improving-xxHash

Bulat Ziganshin definitely knows what he is talking about, he is the author of FreeArc, the best archiver ever :thumbsup: