Skip to content

09 Collision Handling

Last Updated: 2 years ago2023-01-25 ; Contributors: AhmedThahir

Collision Handling

eliminates collisions in hashing

Collision Search Complexity Disadvantage
Separate Chaining array with NN buckets pointing to linked lists O(n)O(n) memory wastage
Linear Probing (i+j)%N(i + j )\% N O(1)O(1) clustering
Quadratic Probing (i+j2)%N(i + j^2) \% N some elements may not be able to stored
Double Hashing

Problem with probing is the possibility of full bucket

  1. Compute i=h(k)i = h(k)

  2. Start at array cell a[i]a[i]

  3. Probe consecutive locations until

return case
present
not present empty cell

something more

Double Hashing

use a secondary hash function d(k)d(k)

Hashing

h(k)=k%Nd(k)=qk%q \begin{aligned} h(k) &= k \% N \\ d(k) &= q - k \% q \\ \end{aligned}

where

  1. qq is prime and q<Nq < N
  2. NN is the no of elements

Bucket Placement

index=(i+jd(k))%Ni=h(k)j=0,1, \begin{aligned} \text{index} = \Big( i+ jd(k) \Big) \% N \\ i &= h(k) \\ j &= 0, 1,\dots \end{aligned}

Load Factor

\[ \lambda = \frac{n}{N} \]
  • \(n\) is the no of keys
  • \(N\) is the no of buckets

Load Factor should preferably be \(\lambda < 0.75\), or atleast \(\lambda < 1\)

Rehashing

Comments