文章目录
  1. 1. 私有变量解析
  2. 2. 构造函数
  3. 3. hash散列
  4. 4. put方法
  5. 5. remove方法
  6. 6. get和containsKey方法
  7. 7. keySet和values集合
  8. 8. HashMap的复杂度
  9. 9. 总结

HashMapJava编程中最常用的容器类之一,它是通过数组+链表实现的,在面试时常常被问实现原理以及与HashTable作比较!

       HashMap是可以存储键值key-value对的容器,他是基于数组+链表的方式实现的,在遇到数组容量不够时也会执行扩容操作,HashMap具有插入、删除操作几乎为O(1)的效率,但是在迭代器中遍历取值时无法保序,接下来文本将从HashMap的构造、put、remove、iterator等方向进行源码的学习。

私有变量解析

       在HashMap的源码中定义的私有变量比ArrayList多很多

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75

/**
* The default initial capacity - MUST be a power of two.
*/

static final int DEFAULT_INITIAL_CAPACITY = 1 << 4; // aka 16

/**
* The maximum capacity, used if a higher value is implicitly specified
* by either of the constructors with arguments.
* MUST be a power of two <= 1<<30.
*/

static final int MAXIMUM_CAPACITY = 1 << 30;

/**
* The load factor used when none specified in constructor.
*/

static final float DEFAULT_LOAD_FACTOR = 0.75f;

/**
* An empty table instance to share when the table is not inflated.
*/

static final Entry<?,?>[] EMPTY_TABLE = {};

/**
* The table, resized as necessary. Length MUST Always be a power of two.
*/

transient Entry<K,V>[] table = (Entry<K,V>[]) EMPTY_TABLE;//HashMap的实际存储容器,table的容量一定要是2的幂次方

/**
* The number of key-value mappings contained in this map.
*/

transient int size;//当前的实际存储大小

/**
* The next size value at which to resize (capacity * load factor).
* @serial
*/

// If table == EMPTY_TABLE then this is the initial capacity at which the
// table will be created when inflated.
int threshold;//当前存储阈值

/**
* The load factor for the hash table.
*
* @serial
*/

final float loadFactor;//负载因子

/**
* The number of times this HashMap has been structurally modified
* Structural modifications are those that change the number of mappings in
* the HashMap or otherwise modify its internal structure (e.g.,
* rehash). This field is used to make iterators on Collection-views of
* the HashMap fail-fast. (See ConcurrentModificationException).
*/

transient int modCount;//修改次数

/**
* The default threshold of map capacity above which alternative hashing is
* used for String keys. Alternative hashing reduces the incidence of
* collisions due to weak hash code calculation for String keys.
* <p/>
* This value may be overridden by defining the system property
* {@code jdk.map.althashing.threshold}. A property value of {@code 1}
* forces alternative hashing to be used at all times whereas
* {@code -1} value ensures that alternative hashing is never used.
*/

static final int ALTERNATIVE_HASHING_THRESHOLD_DEFAULT = Integer.MAX_VALUE;

/**
* A randomizing value associated with this instance that is applied to
* hash code of keys to make hash collisions harder to find. If 0 then
* alternative hashing is disabled.
*/

transient int hashSeed = 0;//Hash种子,当为0时替代的hash方案将会被禁用

       咱们现在来讲解几个重要的变量:

  • DEFAULT_INITIAL_CAPACITY :HashMap的初始化容量默认为16
  • table:HashMap的实际存储容器,可以看出来他是存储在一个键值对的数组中
  • size :HashMap中存储的实际容量
  • thresholdHashMap中当前存储的容量阈值,可在构造函数中初始化,后期在扩容时:threshold=(int) Math.min(capacity * loadFactor, MAXIMUM_CAPACITY + 1)
  • loadFactor:负载因子,当loadFactor取的比较小时,可以降低Hash冲突,当时会浪费table的存储空间,但是loadFactor比较大时,会增加Hash冲突概率,导致降低put,remove,get的效率

       再来看源码中非常重要的一个类

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
static class Entry<K,V> implements Map.Entry<K,V> {
final K key;//键值不可变
V value;
Entry<K,V> next;//实现指针
int hash;

/**
* Creates new entry.
*/

Entry(int h, K k, V v, Entry<K,V> n) {
value = v;
next = n;
key = k;
hash = h;
}

public final K getKey() {
return key;
}

public final V getValue() {
return value;
}

public final V setValue(V newValue) {
V oldValue = value;
value = newValue;
return oldValue;
}

public final boolean equals(Object o) {
if (!(o instanceof Map.Entry))
return false;
Map.Entry e = (Map.Entry)o;
Object k1 = getKey();
Object k2 = e.getKey();
if (k1 == k2 || (k1 != null && k1.equals(k2))) {
Object v1 = getValue();
Object v2 = e.getValue();
if (v1 == v2 || (v1 != null && v1.equals(v2)))
return true;
}
return false;
}

public final int hashCode() {
return Objects.hashCode(getKey()) ^ Objects.hashCode(getValue());
}

public final String toString() {
return getKey() + "=" + getValue();
}

/**
* This method is invoked whenever the value in an entry is
* overwritten by an invocation of put(k,v) for a key k that's already
* in the HashMap.
*/

void recordAccess(HashMap<K,V> m) {
}

/**
* This method is invoked whenever the entry is
* removed from the table.
*/

void recordRemoval(HashMap<K,V> m) {
}
}

       HashMap.EntryHashMap存储中的核心类,也是存储数组的类型,它从Map.Entry继承而来,HashMap.Entry实际上是一个Key-Value格式存储的实体,也正是因此HashMap支持Key-Value格式的存储,细心的小伙伴可以发现:

  • final K key:他的键值使用final来修饰,也就是说HashMap存储中的键值是不可变的
  • Entry next:他有一个next指针,用于实现HashMap.Entry的列表,HashMap当遇到Hash值冲突时使用建立链表来存储数据

       根据上述我们可以知道HashMap的存储结构图如下:

  • HashMap使用Entry类型的数组进行存储
  • EntryHash值冲突时,将已存在的相应桶号(hash值)上的Entry后添加链表的方式进行存储

构造函数

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
/**
* Constructs an empty <tt>HashMap</tt> with the specified initial
* capacity and load factor.
*
* @param initialCapacity the initial capacity
* @param loadFactor the load factor
* @throws IllegalArgumentException if the initial capacity is negative
* or the load factor is nonpositive
*/

public HashMap(int initialCapacity, float loadFactor) {//指定容量和负载因子
if (initialCapacity < 0)
throw new IllegalArgumentException("Illegal initial capacity: " +
initialCapacity);
if (initialCapacity > MAXIMUM_CAPACITY)
initialCapacity = MAXIMUM_CAPACITY;
if (loadFactor <= 0 || Float.isNaN(loadFactor))
throw new IllegalArgumentException("Illegal load factor: " +
loadFactor);

this.loadFactor = loadFactor;
threshold = initialCapacity;
init();
}

/**
* Constructs an empty <tt>HashMap</tt> with the specified initial
* capacity and the default load factor (0.75).
*
* @param initialCapacity the initial capacity.
* @throws IllegalArgumentException if the initial capacity is negative.
*/

public HashMap(int initialCapacity) {//指定容量
this(initialCapacity, DEFAULT_LOAD_FACTOR);
}

/**
* Constructs an empty <tt>HashMap</tt> with the default initial capacity
* (16) and the default load factor (0.75).
*/

public HashMap() {
this(DEFAULT_INITIAL_CAPACITY, DEFAULT_LOAD_FACTOR);
}

/**
* Constructs a new <tt>HashMap</tt> with the same mappings as the
* specified <tt>Map</tt>. The <tt>HashMap</tt> is created with
* default load factor (0.75) and an initial capacity sufficient to
* hold the mappings in the specified <tt>Map</tt>.
*
* @param m the map whose mappings are to be placed in this map
* @throws NullPointerException if the specified map is null
*/

public HashMap(Map<? extends K, ? extends V> m) {//直接支持map容器实体的构造
this(Math.max((int) (m.size() / DEFAULT_LOAD_FACTOR) + 1,
DEFAULT_INITIAL_CAPACITY), DEFAULT_LOAD_FACTOR);
inflateTable(threshold);

putAllForCreate(m);
}

       中除了无参的构造函数之外,还支持:

  • 指定初始容量和负载因子
  • 制定初始容量
  • 直接使用map容器实体来构造

这三种构造函数,其中的指定初始容量和负载因子均不能小于等于0,并且负载因子必须是float类型的

hash散列

HashMap通过自定义的hash函数希望将各个key均匀得分散到各个桶中

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
/**
* Retrieve object hash code and applies a supplemental hash function to the
* result hash, which defends against poor quality hash functions. This is
* critical because HashMap uses power-of-two length hash tables, that
* otherwise encounter collisions for hashCodes that do not differ
* in lower bits. Note: Null keys always map to hash 0, thus index 0.
*/

final int hash(Object k) {//将key值进行hash
int h = hashSeed;
if (0 != h && k instanceof String) {
return sun.misc.Hashing.stringHash32((String) k);
}

h ^= k.hashCode();

// This function ensures that hashCodes that differ only by
// constant multiples at each bit position have a bounded
// number of collisions (approximately 8 at default load factor).
h ^= (h >>> 20) ^ (h >>> 12);
return h ^ (h >>> 7) ^ (h >>> 4);
}

/**
* Returns index for hash code h.
*/

static int indexFor(int h, int length) {//查找桶号
// assert Integer.bitCount(length) == 1 : "length must be a non-zero power of 2";
return h & (length-1);//获取桶号 Bucket
}

       上面hash的源码中我们可以了解到:

  • hashSeed!=0keyString类型时,将使用sun.misc.Hashing.stringHash32((String) k)进行hash,否则主要使用无符号位移位和异或操作得到hash
  • key=null的时候得到的hash值为0
  • HashMaptablelength必须为2的幂次方,在indexFor方法中通过hash值和length-1的与操作来得到桶号,保证获取的桶号小于尺寸阈值,这个桶号的是否均匀很重要,假如t为奇数,则length-1为偶数,二进制的最低位肯定为0,在与hash值进行与操作之后得到的最低位肯定也为0,则最终得到到桶号一定是偶数,这样的话indexFor方法将一定返回偶数,也就是说HashMap存储数组中的奇数索引位置都不会存储值,这会大大浪费存储空间,同时会增加hash冲突的概率,然而当length为偶数时,length-1的最低位为1,与hash值进行与操作时候得到的奇偶数概率各一半,这样就可以均匀的散列到各个桶中了

put方法

put方法是HashMap最常用的一个方法,它传递一组key-value数据往HashMap中添加,如果key存在,则更新该keyvalue,但是你知道put方法整个实现的流程吗?特别是在hash冲突时!

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
/**
* Associates the specified value with the specified key in this map.
* If the map previously contained a mapping for the key, the old
* value is replaced.
*
* @param key key with which the specified value is to be associated
* @param value value to be associated with the specified key
* @return the previous value associated with <tt>key</tt>, or
* <tt>null</tt> if there was no mapping for <tt>key</tt>.
* (A <tt>null</tt> return can also indicate that the map
* previously associated <tt>null</tt> with <tt>key</tt>.)
*/

public V put(K key, V value) {
if (table == EMPTY_TABLE) {
inflateTable(threshold);
}
if (key == null)
return putForNullKey(value);
int hash = hash(key);
int i = indexFor(hash, table.length);
for (Entry<K,V> e = table[i]; e != null; e = e.next) {
Object k;
if (e.hash == hash && ((k = e.key) == key || key.equals(k))) {//已存在key 更新值
V oldValue = e.value;
e.value = value;
e.recordAccess(this);
return oldValue;//返回被覆盖的那个值
}
}

modCount++;
addEntry(hash, key, value, i);
return null;
}

/**
* Inflates the table.//初始化新的table
*/

private void inflateTable(int toSize) {
// Find a power of 2 >= toSize
int capacity = roundUpToPowerOf2(toSize);

threshold = (int) Math.min(capacity * loadFactor, MAXIMUM_CAPACITY + 1);
table = new Entry[capacity];
initHashSeedAsNeeded(capacity);
}

/**
* Offloaded version of put for null keys
*/

private V putForNullKey(V value) {
for (Entry<K,V> e = table[0]; e != null; e = e.next) {
if (e.key == null) {//已存在key 更新值
V oldValue = e.value;
e.value = value;
e.recordAccess(this);
return oldValue;
}
}
modCount++;
addEntry(0, null, value, 0);
return null;
}

/**
* Adds a new entry with the specified key, value and hash code to
* the specified bucket. It is the responsibility of this
* method to resize the table if appropriate.
*
* Subclass overrides this to alter the behavior of put method.
*/

void addEntry(int hash, K key, V value, int bucketIndex) {
if ((size >= threshold) && (null != table[bucketIndex])) {//当size>threshold时 将会进行扩容和rehash
resize(2 * table.length);//2倍扩容
hash = (null != key) ? hash(key) : 0;
bucketIndex = indexFor(hash, table.length);
}

createEntry(hash, key, value, bucketIndex);
}

/**
* Like addEntry except that this version is used when creating entries
* as part of Map construction or "pseudo-construction" (cloning,
* deserialization). This version needn't worry about resizing the table.
*
* Subclass overrides this to alter the behavior of HashMap(Map),
* clone, and readObject.
*/

void createEntry(int hash, K key, V value, int bucketIndex) {
Entry<K,V> e = table[bucketIndex];
table[bucketIndex] = new Entry<>(hash, key, value, e);
size++;
}

/**
* Rehashes the contents of this map into a new array with a
* larger capacity. This method is called automatically when the
* number of keys in this map reaches its threshold.
*
* If current capacity is MAXIMUM_CAPACITY, this method does not
* resize the map, but sets threshold to Integer.MAX_VALUE.
* This has the effect of preventing future calls.
*
* @param newCapacity the new capacity, MUST be a power of two;
* must be greater than current capacity unless current
* capacity is MAXIMUM_CAPACITY (in which case value
* is irrelevant).
*/

void resize(int newCapacity) {
Entry[] oldTable = table;
int oldCapacity = oldTable.length;
if (oldCapacity == MAXIMUM_CAPACITY) {
threshold = Integer.MAX_VALUE;
return;
}

Entry[] newTable = new Entry[newCapacity];
transfer(newTable, initHashSeedAsNeeded(newCapacity));
table = newTable;
threshold = (int)Math.min(newCapacity * loadFactor, MAXIMUM_CAPACITY + 1);
}

//rehash
/**
* Transfers all entries from current table to newTable.
*/

void transfer(Entry[] newTable, boolean rehash) {
int newCapacity = newTable.length;
for (Entry<K,V> e : table) {
while(null != e) {
Entry<K,V> next = e.next;
if (rehash) {
e.hash = null == e.key ? 0 : hash(e.key);
}
int i = indexFor(e.hash, newCapacity);
e.next = newTable[i];
newTable[i] = e;
e = next;
}
}
}

       上面put过程中涉及到几个重要方法的说明:

  • put:HashMap对外开放的接口方法,如果已存在key的键,进行更新操作,返回oldValue,否则调用addEntry进行插入,返回null
  • inflateTable:初始化的空的table
  • putForNullKey:插入或者更新键为null的值,如果已存在null的键,进行更新操作,返回oldValue,否则调用addEntry进行插入,返回null
  • addEntry:添加Entry实体,会进行size >= threshold判断,如果成立,触发resize进行扩容,最终会调用createEntry插入key-value
  • createEntry:将key-value插入table[buctetIndex]的表头(这样做在插入的时候就不需要判断原来的链表上是否有值了,bingo)
  • resize:进行扩容操作,扩容完成之后容量为原来的2倍,同时会调用transfer进行rehash
  • transfer:根据原来的key和新的length进行rehash,得到新的table

       将上述执行流程使用图来表示之后:

remove方法

removeHashMap中最长用的方法之一,他是通过链表进行删除的

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
/**
* Removes the mapping for the specified key from this map if present.
*
* @param key key whose mapping is to be removed from the map
* @return the previous value associated with <tt>key</tt>, or
* <tt>null</tt> if there was no mapping for <tt>key</tt>.
* (A <tt>null</tt> return can also indicate that the map
* previously associated <tt>null</tt> with <tt>key</tt>.)
*/

public V remove(Object key) {
Entry<K,V> e = removeEntryForKey(key);
return (e == null ? null : e.value);
}

/**
* Removes and returns the entry associated with the specified key
* in the HashMap. Returns null if the HashMap contains no mapping
* for this key.
*/

final Entry<K,V> removeEntryForKey(Object key) {
if (size == 0) {
return null;
}
int hash = (key == null) ? 0 : hash(key);
int i = indexFor(hash, table.length);//桶号
Entry<K,V> prev = table[i];
Entry<K,V> e = prev;

while (e != null) {
Entry<K,V> next = e.next;
Object k;
if (e.hash == hash &&
((k = e.key) == key || (key != null && key.equals(k)))) {//通过equals方法进行key的查找
modCount++;
size--;
if (prev == e)
table[i] = next;
else
prev.next = next;
e.recordRemoval(this);
return e;
}
prev = e;
e = next;
}

return e;
}

       remove方法比较简单,首先是得到删除keyhash值和其对应的桶号ikey=null时其桶号为0),根据其i即可取得所在的单链表,然后循环单链表,通过equals来查找是否存在对应的key,如存在,则在单链表上删除key所对应的Entry,将删除的Entry进行返回,否则返回null

get和containsKey方法

我们常常会先执行containsKey来判断HashMap中是否存在所查询的key值,如存在再进行get,那这两个方法实现上是?

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
/**
* Returns the value to which the specified key is mapped,
* or {@code null} if this map contains no mapping for the key.
*
* <p>More formally, if this map contains a mapping from a key
* {@code k} to a value {@code v} such that {@code (key==null ? k==null :
* key.equals(k))}, then this method returns {@code v}; otherwise
* it returns {@code null}. (There can be at most one such mapping.)
*
* <p>A return value of {@code null} does not <i>necessarily</i>
* indicate that the map contains no mapping for the key; it's also
* possible that the map explicitly maps the key to {@code null}.
* The {@link #containsKey containsKey} operation may be used to
* distinguish these two cases.
*
* @see #put(Object, Object)
*/

public V get(Object key) {
if (key == null)
return getForNullKey();
Entry<K,V> entry = getEntry(key);

return null == entry ? null : entry.getValue();
}

/**
* Offloaded version of get() to look up null keys. Null keys map
* to index 0. This null case is split out into separate methods
* for the sake of performance in the two most commonly used
* operations (get and put), but incorporated with conditionals in
* others.
*/

private V getForNullKey() {//当查询的key为null的时候 需要遍历这个table[0]链表进行查找
if (size == 0) {
return null;
}
for (Entry<K,V> e = table[0]; e != null; e = e.next) {
if (e.key == null)
return e.value;
}
return null;
}

/**
* Returns <tt>true</tt> if this map contains a mapping for the
* specified key.
*
* @param key The key whose presence in this map is to be tested
* @return <tt>true</tt> if this map contains a mapping for the specified
* key.
*/

public boolean containsKey(Object key) {
return getEntry(key) != null;
}

/**
* Returns the entry associated with the specified key in the
* HashMap. Returns null if the HashMap contains no mapping
* for the key.
*/

final Entry<K,V> getEntry(Object key) {
if (size == 0) {
return null;
}

int hash = (key == null) ? 0 : hash(key);
for (Entry<K,V> e = table[indexFor(hash, table.length)];
e != null;
e = e.next) {
Object k;
if (e.hash == hash &&
((k = e.key) == key || (key != null && key.equals(k))))
return e;
}
return null;
}

       有没有惊奇的发现getcontainsKey最终都会调用getEntry方法,只是在返回值上作了不同的判断而已,getEntry方法是根据key通过遍历链表来获取对应的Entry实体。另外get在同前面一样在null上也是作了单独的处理。

keySet和values集合

在遍历HashMap取值时主要有:

  • keySet:取得键集合,迭代键集合通过get方法逐个取值
  • values:直接取得HashMapvalue集合进行遍历

下面我们看下这两个方法在源码中的实现
keySet方法的实现:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
/**
* Returns a {@link Set} view of the keys contained in this map.
* The set is backed by the map, so changes to the map are
* reflected in the set, and vice-versa. If the map is modified
* while an iteration over the set is in progress (except through
* the iterator's own <tt>remove</tt> operation), the results of
* the iteration are undefined. The set supports element removal,
* which removes the corresponding mapping from the map, via the
* <tt>Iterator.remove</tt>, <tt>Set.remove</tt>,
* <tt>removeAll</tt>, <tt>retainAll</tt>, and <tt>clear</tt>
* operations. It does not support the <tt>add</tt> or <tt>addAll</tt>
* operations.
*/

public Set<K> keySet() {
Set<K> ks = keySet;
return (ks != null ? ks : (keySet = new KeySet()));
}

private final class KeySet extends AbstractSet<K> {
public Iterator<K> iterator() {
return newKeyIterator();
}
public int size() {
return size;
}
public boolean contains(Object o) {
return containsKey(o);
}
public boolean remove(Object o) {
return HashMap.this.removeEntryForKey(o) != null;
}
public void clear() {
HashMap.this.clear();
}
}

// Subclass overrides these to alter behavior of views' iterator() method
Iterator<K> newKeyIterator() {
return new KeyIterator();
}

private final class KeyIterator extends HashIterator<K> {//最终还是基于HashIterator迭代器
public K next() {
return nextEntry().getKey();
}
}

values方法的实现:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
/**
* Returns a {@link Collection} view of the values contained in this map.
* The collection is backed by the map, so changes to the map are
* reflected in the collection, and vice-versa. If the map is
* modified while an iteration over the collection is in progress
* (except through the iterator's own <tt>remove</tt> operation),
* the results of the iteration are undefined. The collection
* supports element removal, which removes the corresponding
* mapping from the map, via the <tt>Iterator.remove</tt>,
* <tt>Collection.remove</tt>, <tt>removeAll</tt>,
* <tt>retainAll</tt> and <tt>clear</tt> operations. It does not
* support the <tt>add</tt> or <tt>addAll</tt> operations.
*/

public Collection<V> values() {
Collection<V> vs = values;
return (vs != null ? vs : (values = new Values()));
}

private final class Values extends AbstractCollection<V> {
public Iterator<V> iterator() {
return newValueIterator();
}
public int size() {
return size;
}
public boolean contains(Object o) {
return containsValue(o);
}
public void clear() {
HashMap.this.clear();
}
}

Iterator<V> newValueIterator() {
return new ValueIterator();
}

private final class ValueIterator extends HashIterator<V> {//最终还是基于HashIterator迭代器
public V next() {
return nextEntry().value;
}
}

       通过上面keySetvalues这两个方法的源码会发现他们最终都是调用HashIterator来进行实现的,只是keySet通过HashIterator迭代器之后返回的是Entrykey,而values返回的是Entryvalue

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
private abstract class HashIterator<E> implements Iterator<E> {
Entry<K,V> next; // next entry to return
int expectedModCount; // For fast-fail
int index; // current slot
Entry<K,V> current; // current entry

HashIterator() {
expectedModCount = modCount;
if (size > 0) { // advance to first entry
Entry[] t = table;
while (index < t.length && (next = t[index++]) == null)//一直循环到有值的Entry作为起始的Iter起点
;
}
}

public final boolean hasNext() {
return next != null;
}

final Entry<K,V> nextEntry() {
if (modCount != expectedModCount)
throw new ConcurrentModificationException();
Entry<K,V> e = next;
if (e == null)
throw new NoSuchElementException();

if ((next = e.next) == null) {
Entry[] t = table;
while (index < t.length && (next = t[index++]) == null)
;
}
current = e;
return e;
}

public void remove() {
if (current == null)
throw new IllegalStateException();
if (modCount != expectedModCount)
throw new ConcurrentModificationException();
Object k = current.key;
current = null;
HashMap.this.removeEntryForKey(k);
expectedModCount = modCount;
}
}

       HashIterator在实现时主要解决的时缺省问题,因为HashMaptable是根据散列码来存储的,并不是连续的,在进行迭代的时候需要隔离开这些不连续的数据,在HashIterator中是靠while (index < t.length && (next = t[index++]) == null)进行不连续缺省值的判断的。

HashMap的复杂度

       如果说HashMapget,put,remove方法的复杂度是O(1),大多数程序都应该会认同吧?其实看了上面的源码之后说O(1)的并不准确,执行这三个方法他们都要做的一件事情就是遍历桶号所在的链表,进行已有key的判断,当这个链表的长度为0或者为1时,自然的他们的复杂度都是O(1),那是在没有hash冲突的情况下,但是假如存在hash冲突,那么必然两个不同的key会散列到同一个链表中,其实他们的复杂度就不是O(1)了,所以我觉得比较安全的说法是这三个方法的复杂度近似为O(1)。
       关于HashMap的优化,就是需要减少hash冲突,越少越好,我们知道table的容量越大,hash冲突的概率会越低,但是table的利用率也会越低,所以table的容量至关重要,我们也知道size>=table.length*loadFactor时,table就会扩容量,所以我们在优化loadFactor参数很重要。

总结

  • HashMap不是线程安全的类
  • HashMap是基于数组加链表的形式实现的
  • HashMap支持null键和值
  • HashMapsize>=table.length*loadFactor时会进行2倍扩容
  • HashMapget,put,remove复杂度近似O(1)
  • loadFactor是优化的重要参数,loadFactor越大,hash冲突的概率越大,table的利用率越大,反之都越小

本作品采用[知识共享署名-非商业性使用-相同方式共享 2.5]中国大陆许可协议进行许可,我的博客欢迎复制共享,但在同时,希望保留我的署名权kubiCode,并且,不得用于商业用途。如您有任何疑问或者授权方面的协商,请给我留言

文章目录
  1. 1. 私有变量解析
  2. 2. 构造函数
  3. 3. hash散列
  4. 4. put方法
  5. 5. remove方法
  6. 6. get和containsKey方法
  7. 7. keySet和values集合
  8. 8. HashMap的复杂度
  9. 9. 总结