[U-Boot] Hash problem...

newer
[U-Boot] [PATCH v3] Introduce a...

older
[U-Boot] Ein Mehrverdienst mit dem...

Peter Barada

17 Jan 2011 17 Jan '11

11:53 p.m.

I've stumbled across a problem in U-boot 2010.12's hashing algorithm.

In my particular case the size of the has table is 347 (due to my default environment being 2258 bytes long as reported on entry to himport_r), and I have in my envrironment the variable "ramdiskimage" as well as "preboot"; part of my startup is to do some housecleaning then delete preboot so its only done once. But when I load u-boot with an empty environemnt environment, I see the following problem:

... => printenv .... ramdiskimage=rootfs.ext2.gz.uboot .... => echo $ramdiskimage

=> setenv ramdiskimage=rootfs.ext2.gz.uboot => printenv ... ramdiskimage=rootfs.ext2.gz.uboot ramdiskimage=rootfs.ext2.gz.uboot ...

After spending an entire day digging into the hash using GDB/BDI on my ARM board, I've findally figured out that the hash key of "ramdiskimage" and "preboot" are the same modulus 347, and this is problematic because on the initial hash import, preboot is placed into the hash first (at idx 190 since the table is sorted), and then ramdiskimage collides with preboot causing the 2nd probe (at idx 191) to occur which works fine. Unfortunately as part of the housecleaning, preboot is deleted and the environemnt saved. The delete of preboot removes entry at idx 190 and the next lookup of ramdiskimage sees that idx 190 is empty and believes that the ramdiskimage is not in the table ionstead of rehashing to find it at idx 191.

The hash delete code is in error; instead of just removing the deleted key, it should instead allocate a new hashtable, hash all the keys into the new table except for the deleted key and then reclaim the old table (and deleted key).

-- Peter Barada peter.barada@logicpd.com

Show replies by date

Wolfgang Denk

19 Jan 19 Jan

9:32 a.m.

Dear Peter Barada,

In message 4D34C85E.4030408@logicpd.com you wrote:

...

After spending an entire day digging into the hash using GDB/BDI on my ARM board, I've findally figured out that the hash key of "ramdiskimage" and "preboot" are the same modulus 347, and this is problematic because on the initial hash import, preboot is placed into the hash first (at idx 190 since the table is sorted), and then ramdiskimage collides with preboot causing the 2nd probe (at idx 191) to occur which works fine. Unfortunately as part of the housecleaning, preboot is deleted and the environemnt saved. The delete of preboot removes entry at idx 190 and the next lookup of ramdiskimage sees that idx 190 is empty and believes that the ramdiskimage is not in the table ionstead of rehashing to find it at idx 191.

Thanks for debugging this.

...

The hash delete code is in error; instead of just removing the deleted key, it should instead allocate a new hashtable, hash all the keys into the new table except for the deleted key and then reclaim the old table (and deleted key).

Can you please come up with a patch?

Best regards,

Wolfgang Denk

-- DENX Software Engineering GmbH, MD: Wolfgang Denk & Detlev Zundel HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany Phone: (+49)-8142-66989-10 Fax: (+49)-8142-66989-80 Email: wd@denx.de "Wagner's music is better than it sounds." - Mark Twain

Peter Barada

5:32 p.m.

New subject: [U-Boot] [Patch] Fix hash table deletion to prevent lost entries

...

...
The hash delete code is in error; instead of just removing the deleted key, it should instead allocate a new hashtable, hash all the keys into the new table except for the deleted key and then reclaim the old table (and deleted key).

Can you please come up with a patch?

Hashtable delete is broken since freeing entry could break collision chain for other entries. Instead free key/data pair and mark entry as deleted (via negative used value) and then skip deleted entries while probing.

Break up hsearch_r into separate henter_r/hfind_r functions for readability. Recycle first deleted entry in probe chain when adding entry to table. Factor primary/secondary hash calculations into functions and use in henter_r/hfind_r/hdelete_r.

--- include/search.h | 16 ++- lib/hashtable.c | 382 ++++++++++++++++++++++++++++++++--------------------- 2 files changed, 241 insertions(+), 157 deletions(-)

diff --git a/include/search.h b/include/search.h index a7c1293..4466731 100644 --- a/include/search.h +++ b/include/search.h @@ -66,12 +66,16 @@ extern int hcreate_r(size_t __nel, struct hsearch_data *__htab); extern void hdestroy_r(struct hsearch_data *__htab);

/* - * Search for entry matching ITEM.key in internal hash table. If - * ACTION is `FIND' return found entry or signal error by returning - * NULL. If ACTION is `ENTER' replace existing data (if any) with - * ITEM.data. - * */ -extern int hsearch_r(ENTRY __item, ACTION __action, ENTRY ** __retval, + * Search for entry matching ITEM.key in internal hash table. Return + * found entry or signal error by returning NULL. + */ +extern int hfind_r(ENTRY __item, ENTRY ** __retval, + struct hsearch_data *__htab); +/* + * Add entry matching ITEM.key in internal hash table. Replace + * existing data (if any) with ITEM.data. + */ +extern int henter_r(ENTRY __item, ENTRY ** __retval, struct hsearch_data *__htab);

/* diff --git a/lib/hashtable.c b/lib/hashtable.c index 9f069c0..7b5153a 100644 --- a/lib/hashtable.c +++ b/lib/hashtable.c @@ -65,7 +65,7 @@ * which describes the current status. */ typedef struct _ENTRY { - unsigned int used; + int used; ENTRY entry; } _ENTRY;

@@ -152,7 +152,7 @@ void hdestroy_r(struct hsearch_data *htab)

/* free used memory */ for (i = 1; i <= htab->size; ++i) { - if (htab->table[i].used) { + if (htab->table[i].used > 0) { ENTRY *ep = &htab->table[i].entry;

free(ep->key); @@ -165,77 +165,20 @@ void hdestroy_r(struct hsearch_data *htab) htab->table = NULL; }

-/* - * hsearch() - */ - -/* - * This is the search function. It uses double hashing with open addressing. - * The argument item.key has to be a pointer to an zero terminated, most - * probably strings of chars. The function for generating a number of the - * strings is simple but fast. It can be replaced by a more complex function - * like ajw (see [Aho,Sethi,Ullman]) if the needs are shown. - * - * We use an trick to speed up the lookup. The table is created by hcreate - * with one more element available. This enables us to use the index zero - * special. This index will never be used because we store the first hash - * index in the field used where zero means not used. Every other value - * means used. The used field can be used as a first fast comparison for - * equality of the stored and the parameter value. This helps to prevent - * unnecessary expensive calls of strcmp. - * - * This implementation differs from the standard library version of - * this function in a number of ways: - * - * - While the standard version does not make any assumptions about - * the type of the stored data objects at all, this implementation - * works with NUL terminated strings only. - * - Instead of storing just pointers to the original objects, we - * create local copies so the caller does not need to care about the - * data any more. - * - The standard implementation does not provide a way to update an - * existing entry. This version will create a new entry or update an - * existing one when both "action == ENTER" and "item.data != NULL". - * - Instead of returning 1 on success, we return the index into the - * internal hash table, which is also guaranteed to be positive. - * This allows us direct access to the found hash table slot for - * example for functions like hdelete(). - */ - -int hmatch_r(const char *match, int last_idx, ENTRY ** retval, - struct hsearch_data *htab) -{ - unsigned int idx; - size_t key_len = strlen(match); - - for (idx = last_idx + 1; idx < htab->size; ++idx) { - if (!htab->table[idx].used) - continue; - if (!strncmp(match, htab->table[idx].entry.key, key_len)) { - *retval = &htab->table[idx].entry; - return idx; - } - } - - __set_errno(ESRCH); - *retval = NULL; - return 0; -}

-int hsearch_r(ENTRY item, ACTION action, ENTRY ** retval, - struct hsearch_data *htab) +/* Primary hash function */ +int hash_primary(ENTRY *item, struct hsearch_data *htab) { + int len = strlen(item->key); unsigned int hval; unsigned int count; - unsigned int len = strlen(item.key); - unsigned int idx;

- /* Compute an value for the given string. Perhaps use a better method. */ + /* Compute a value for the given string. Perhaps use a better method. */ hval = len; count = len; while (count-- > 0) { hval <<= 4; - hval += item.key[count]; + hval += item->key[count]; }

/* @@ -246,20 +189,95 @@ int hsearch_r(ENTRY item, ACTION action, ENTRY ** retval, if (hval == 0) ++hval;

+ return hval; +} + +/* Secondary hash function */ +int hash_secondary(unsigned int hval, struct hsearch_data *htab) +{ + unsigned int hval2; + + /* + * Second hash function: + * as suggested in [Knuth] + */ + hval2 = 1 + hval % (htab->size - 2); + return hval2; +} + +/* + * Find an entry in the hashtable. Skip negative used counts; these mark + * deleted entries. + */ +int hfind_r(ENTRY item, ENTRY ** retval, struct hsearch_data *htab) +{ + unsigned int hval; + unsigned int idx; + /* The first index tried. */ - idx = hval; + idx = hval = hash_primary(&item, htab); + + debug("hash(%s) = %d/%d; used %d\n", item.key, idx, htab->size, htab->table[idx].used); + + do { + unsigned int hval2; + + if (htab->table[idx].used == hval + && strcmp(item.key, htab->table[idx].entry.key) == 0) { + /* return found entry */ + *retval = &htab->table[idx].entry; + return idx; + } + + if (!htab->table[idx].used) { + /* end of chain, return empty */ + __set_errno(ESRCH); + *retval = NULL; + return 0; + } + + hval2 = hash_secondary(hval, htab);

- if (htab->table[idx].used) { /* - * Further action might be required according to the - * action value. + * Because SIZE is prime this guarantees to + * step through all available indices. */ - unsigned hval2; + if (idx <= hval2) + idx = htab->size + idx - hval2; + else + idx -= hval2; + + /* If we visited all entries leave the loop */ + } while (idx != hval); + __set_errno(ESRCH); + *retval = NULL; + return 0; +} + +/* + * Add/update an entry in the table; search the whole chain + * looking for the entry. Remember first index of deleted entry and + * use if found to hold new entry otherwise use empty entry at end of + * probe chain + */ +int henter_r(ENTRY item, ENTRY ** retval, struct hsearch_data *htab) +{ + unsigned int hval; + unsigned int idx; + int deleted_entry = 0; + + /* The first index tried. */ + idx = hval = hash_primary(&item, htab); + + debug("hash(%s) = %d/%d; used %d\n", item.key, idx, htab->size, htab->table[idx].used); + + do { + unsigned int hval2;

if (htab->table[idx].used == hval - && strcmp(item.key, htab->table[idx].entry.key) == 0) { + && strcmp(item.key, htab->table[idx].entry.key) == 0) { /* Overwrite existing value? */ - if ((action == ENTER) && (item.data != NULL)) { + if (item.data != NULL) { free(htab->table[idx].entry.data); htab->table[idx].entry.data = strdup(item.data); @@ -274,82 +292,122 @@ int hsearch_r(ENTRY item, ACTION action, ENTRY ** retval, return idx; }

- /* - * Second hash function: - * as suggested in [Knuth] - */ - hval2 = 1 + hval % (htab->size - 2); - - do { + if (htab->table[idx].used < 0) { + /* Remember deleted entry to use for inserution */ + if (!deleted_entry) + deleted_entry = idx + 1; + } else if (!htab->table[idx].used) { /* - * Because SIZE is prime this guarantees to - * step through all available indices. + * If table is full and another entry should be + * entered return with error. */ - if (idx <= hval2) - idx = htab->size + idx - hval2; - else - idx -= hval2; + if (htab->filled == htab->size) { + __set_errno(ENOMEM); + *retval = NULL; + return 0; + }

/* - * If we visited all entries leave the loop - * unsuccessfully. + * Create new entry; recyle deleted entry if found + * create copies of item.key and item.data */ - if (idx == hval) - break; - - /* If entry is found use it. */ - if ((htab->table[idx].used == hval) - && strcmp(item.key, htab->table[idx].entry.key) == 0) { - /* Overwrite existing value? */ - if ((action == ENTER) && (item.data != NULL)) { - free(htab->table[idx].entry.data); - htab->table[idx].entry.data = - strdup(item.data); - if (!htab->table[idx].entry.data) { - __set_errno(ENOMEM); - *retval = NULL; - return 0; - } - } - /* return found entry */ - *retval = &htab->table[idx].entry; - return idx; + if (deleted_entry) + idx = deleted_entry - 1; + + htab->table[idx].used = hval; + htab->table[idx].entry.key = strdup(item.key); + htab->table[idx].entry.data = strdup(item.data); + if (!htab->table[idx].entry.key || + !htab->table[idx].entry.data) { + __set_errno(ENOMEM); + *retval = NULL; + return 0; } - } - while (htab->table[idx].used); - }

- /* An empty bucket has been found. */ - if (action == ENTER) { - /* - * If table is full and another entry should be - * entered return with error. - */ - if (htab->filled == htab->size) { - __set_errno(ENOMEM); - *retval = NULL; - return 0; + ++htab->filled; + + /* return new entry */ + *retval = &htab->table[idx].entry; + return 1; }

+ hval2 = hash_secondary(hval, htab); + /* - * Create new entry; - * create copies of item.key and item.data + * Because SIZE is prime this guarantees to + * step through all available indices. */ - htab->table[idx].used = hval; - htab->table[idx].entry.key = strdup(item.key); - htab->table[idx].entry.data = strdup(item.data); - if (!htab->table[idx].entry.key || - !htab->table[idx].entry.data) { - __set_errno(ENOMEM); - *retval = NULL; - return 0; - } + if (idx <= hval2) + idx = htab->size + idx - hval2; + else + idx -= hval2; + + /* If we visited all entries leave the loop */ + } while (idx != hval); + __set_errno(ESRCH); + *retval = NULL; + return 0; +} + + +/* + * hsearch() + */ + +int hsearch_r(ENTRY item, ACTION action, ENTRY ** retval, + struct hsearch_data *htab) +{ + if (action == FIND) + return hfind_r(item, retval, htab); + else + return henter_r(item, retval, htab); +} + +/* + * This is the search function. It uses double hashing with open addressing. + * The argument item.key has to be a pointer to an zero terminated, most + * probably strings of chars. The function for generating a number of the + * strings is simple but fast. It can be replaced by a more complex function + * like ajw (see [Aho,Sethi,Ullman]) if the needs are shown. + * + * We use an trick to speed up the lookup. The table is created by hcreate + * with one more element available. This enables us to use the index zero + * special. This index will never be used because we store the first hash + * index in the field used where zero means not used. Every other non-negative + * value means used and a negative value means a deleted entry. The used field + * can be used as a first fast comparison for equality of the stored and the + * parameter value. This helps to prevent unnecessary expensive calls of strcmp. + * + * This implementation differs from the standard library version of + * this function in a number of ways: + * + * - While the standard version does not make any assumptions about + * the type of the stored data objects at all, this implementation + * works with NUL terminated strings only. + * - Instead of storing just pointers to the original objects, we + * create local copies so the caller does not need to care about the + * data any more. + * - The standard implementation does not provide a way to update an + * existing entry. This version will create a new entry or update an + * existing one when both "action == ENTER" and "item.data != NULL". + * - Instead of returning 1 on success, we return the index into the + * internal hash table, which is also guaranteed to be positive. + * This allows us direct access to the found hash table slot. + */

- ++htab->filled; +int hmatch_r(const char *match, int last_idx, ENTRY ** retval, + struct hsearch_data *htab) +{ + unsigned int idx; + size_t key_len = strlen(match);

- /* return new entry */ - *retval = &htab->table[idx].entry; - return 1; + for (idx = last_idx + 1; idx < htab->size; ++idx) { + if (htab->table[idx].used <= 0) + continue; + if (!strncmp(match, htab->table[idx].entry.key, key_len)) { + *retval = &htab->table[idx].entry; + return idx; + } }

__set_errno(ESRCH); @@ -357,7 +415,6 @@ int hsearch_r(ENTRY item, ACTION action, ENTRY ** retval, return 0; }

- /* * hdelete() */ @@ -365,33 +422,56 @@ int hsearch_r(ENTRY item, ACTION action, ENTRY ** retval, /* * The standard implementation of hsearch(3) does not provide any way * to delete any entries from the hash table. We extend the code to - * do that. + * do that. This is done by storing a negative value in used. */

int hdelete_r(const char *key, struct hsearch_data *htab) { - ENTRY e, *ep; - int idx; + ENTRY e; + unsigned int hval; + unsigned int idx;

debug("hdelete: DELETE key "%s"\n", key);

e.key = (char *)key;

- if ((idx = hsearch_r(e, FIND, &ep, htab)) == 0) { - __set_errno(ESRCH); - return 0; /* not found */ - } + idx = hval = hash_primary(&e, htab); + do { + unsigned int hval2;

- /* free used ENTRY */ - debug("hdelete: DELETING key "%s"\n", key); + if (htab->table[idx].used == hval + && strcmp(e.key, htab->table[idx].entry.key) == 0) { + /* Delete key/data pair */ + free(htab->table[idx].entry.key); + free(htab->table[idx].entry.data); + /* Mark entry as deleted */ + htab->table[idx].used = -1; + --htab->filled; + return 1; + }

- free(ep->key); - free(ep->data); - htab->table[idx].used = 0; + if (!htab->table[idx].used) { + /* end of chain, return empty */ + __set_errno(ESRCH); + return 0; + }

- --htab->filled; + hval2 = hash_secondary(hval, htab);

- return 1; + /* + * Because SIZE is prime this guarantees to + * step through all available indices. + */ + if (idx <= hval2) + idx = htab->size + idx - hval2; + else + idx -= hval2; + + /* If we visited all entries leave the loop */ + } while (idx != hval); + + __set_errno(ESRCH); + return 0; }

/* @@ -467,7 +547,7 @@ ssize_t hexport_r(struct hsearch_data *htab, const char sep, */ for (i = 1, n = 0, totlen = 0; i <= htab->size; ++i) {

- if (htab->table[i].used) { + if (htab->table[i].used > 0) { ENTRY *ep = &htab->table[i].entry;

list[n++] = ep; @@ -703,7 +783,7 @@ int himport_r(struct hsearch_data *htab, e.key = name; e.data = value;

- hsearch_r(e, ENTER, &rv, htab); + henter_r(e, &rv, htab); if (rv == NULL) { printf("himport_r: can't insert "%s=%s" into hash table\n", name, value);

-- 1.7.0.4 -- Peter Barada peter.barada@logicpd.com

Wolfgang Denk

9:47 p.m.

New subject: [U-Boot] [Patch] Fix hash table deletion to prevent lost entries

Dear Peter Barada,

In message 4D371208.3090801@logicpd.com you wrote:

...

...
...
The hash delete code is in error; instead of just removing the deleted key, it should instead allocate a new hashtable, hash all the keys into the new table except for the deleted key and then reclaim the old table (and deleted key).

Can you please come up with a patch?

Hashtable delete is broken since freeing entry could break collision chain for other entries. Instead free key/data pair and mark entry as deleted (via negative used value) and then skip deleted entries while probing.

Break up hsearch_r into separate henter_r/hfind_r functions for readability. Recycle first deleted entry in probe chain when adding entry to table. Factor primary/secondary hash calculations into functions and use in henter_r/hfind_r/hdelete_r.

include/search.h | 16 ++- lib/hashtable.c | 382 ++++++++++++++++++++++++++++++++--------------------- 2 files changed, 241 insertions(+), 157 deletions(-)

diff --git a/include/search.h b/include/search.h index a7c1293..4466731 100644 --- a/include/search.h +++ b/include/search.h @@ -66,12 +66,16 @@ extern int hcreate_r(size_t __nel, struct hsearch_data *__htab); extern void hdestroy_r(struct hsearch_data *__htab);

/*

Search for entry matching ITEM.key in internal hash table. If

ACTION is `FIND' return found entry or signal error by returning

NULL. If ACTION is `ENTER' replace existing data (if any) with

ITEM.data.

*/

-extern int hsearch_r(ENTRY __item, ACTION __action, ENTRY ** __retval,

Search for entry matching ITEM.key in internal hash table. Return

found entry or signal error by returning NULL.

*/

+extern int hfind_r(ENTRY __item, ENTRY ** __retval,
        struct hsearch_data *__htab);
+/*

Add entry matching ITEM.key in internal hash table. Replace

existing data (if any) with ITEM.data.

*/

+extern int henter_r(ENTRY __item, ENTRY ** __retval, struct hsearch_data *__htab);

/* diff --git a/lib/hashtable.c b/lib/hashtable.c index 9f069c0..7b5153a 100644 --- a/lib/hashtable.c +++ b/lib/hashtable.c @@ -65,7 +65,7 @@

which describes the current status.

*/ typedef struct _ENTRY {

unsigned int used;

int used; ENTRY entry; } _ENTRY;

@@ -152,7 +152,7 @@ void hdestroy_r(struct hsearch_data *htab)
  /* free used memory */
  for (i = 1; i <= htab->size; ++i) {
   if (htab->table[i].used) {
   if (htab->table[i].used > 0) {
        ENTRY *ep = &htab->table[i].entry;

        free(ep->key);
@@ -165,77 +165,20 @@ void hdestroy_r(struct hsearch_data *htab) htab->table = NULL; }

-/*

hsearch()

*/

-/*

This is the search function. It uses double hashing with open

addressing.

The argument item.key has to be a pointer to an zero terminated, most

probably strings of chars. The function for generating a number of the

strings is simple but fast. It can be replaced by a more complex

function

Your patch is line-wrapped and does not apply.

Also, insteadof imlementing just the discussed fix, it appears to me a major rewrite - you thow away lots of (useful, in my opinion) comments, debug code, etc., and even replace major parts of the code.

In the result, the code looks pretty much different from the standard C library function it was derived from.

I don't think all this was needed only to fix the observed problem?

What is your rationale for all these changes?

Best regards,

Wolfgang Denk

-- DENX Software Engineering GmbH, MD: Wolfgang Denk & Detlev Zundel HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany Phone: (+49)-8142-66989-10 Fax: (+49)-8142-66989-80 Email: wd@denx.de As far as the laws of mathematics refer to reality, they are not cer- tain, and as far as they are certain, they do not refer to reality. -- Albert Einstein

Peter Barada

20 Jan 20 Jan

4:53 p.m.

New subject: [U-Boot] [Patch v2] Fix hash table deletion to prevent lost entries

On 01/19/2011 03:47 PM, Wolfgang Denk wrote:

...

Dear Peter Barada,

In message 4D371208.3090801@logicpd.com you wrote:

...
...
...
The hash delete code is in error; instead of just removing the deleted key, it should instead allocate a new hashtable, hash all the keys into the new table except for the deleted key and then reclaim the old table (and deleted key).

Can you please come up with a patch?

From: Peter Barada peter.barada@logicpd.com Date: Thu, 20 Jan 2011 10:38:57 -0500 Subject: [PATCH] Fix hashtable to properly handle deletion.

Use negative used value to mark deleted entry. Search keeps probing past deleted entries. Adding an entry uses first deleted entry when it hits end of probe chain.

Signed-off-by: Peter Barada peter.barada@logicpd.com --- lib/hashtable.c | 18 +++++++++++++----- 1 files changed, 13 insertions(+), 5 deletions(-)

diff --git a/lib/hashtable.c b/lib/hashtable.c index 9f069c0..fcdb53c 100644 --- a/lib/hashtable.c +++ b/lib/hashtable.c @@ -65,7 +65,7 @@ * which describes the current status. */ typedef struct _ENTRY { - unsigned int used; + int used; ENTRY entry; } _ENTRY;

@@ -152,7 +152,7 @@ void hdestroy_r(struct hsearch_data *htab)

/* free used memory */ for (i = 1; i <= htab->size; ++i) { - if (htab->table[i].used) { + if (htab->table[i].used > 0) { ENTRY *ep = &htab->table[i].entry;

free(ep->key); @@ -209,7 +209,7 @@ int hmatch_r(const char *match, int last_idx, ENTRY ** retval, size_t key_len = strlen(match);

for (idx = last_idx + 1; idx < htab->size; ++idx) { - if (!htab->table[idx].used) + if (htab->table[idx].used > 0) continue; if (!strncmp(match, htab->table[idx].entry.key, key_len)) { *retval = &htab->table[idx].entry; @@ -229,6 +229,7 @@ int hsearch_r(ENTRY item, ACTION action, ENTRY ** retval, unsigned int count; unsigned int len = strlen(item.key); unsigned int idx; + unsigned int first_deleted = 0;

/* Compute an value for the given string. Perhaps use a better method. */ hval = len; @@ -256,6 +257,10 @@ int hsearch_r(ENTRY item, ACTION action, ENTRY ** retval, */ unsigned hval2;

+ if (htab->table[idx].used == -1 + && !first_deleted) + first_deleted = idx; + if (htab->table[idx].used == hval && strcmp(item.key, htab->table[idx].entry.key) == 0) { /* Overwrite existing value? */ @@ -335,6 +340,9 @@ int hsearch_r(ENTRY item, ACTION action, ENTRY ** retval, * Create new entry; * create copies of item.key and item.data */ + if (first_deleted) + idx = first_deleted; + htab->table[idx].used = hval; htab->table[idx].entry.key = strdup(item.key); htab->table[idx].entry.data = strdup(item.data); @@ -387,7 +395,7 @@ int hdelete_r(const char *key, struct hsearch_data *htab)

free(ep->key); free(ep->data); - htab->table[idx].used = 0; + htab->table[idx].used = -1;

--htab->filled;

@@ -467,7 +475,7 @@ ssize_t hexport_r(struct hsearch_data *htab, const char sep, */ for (i = 1, n = 0, totlen = 0; i <= htab->size; ++i) {

- if (htab->table[i].used) { + if (htab->table[i].used > 0) { ENTRY *ep = &htab->table[i].entry;

list[n++] = ep;

-- 1.7.0.4 -- Peter Barada peter.barada@logicpd.com

Wolfgang Denk

21 Mar 21 Mar

10:48 p.m.

New subject: [U-Boot] [Patch v2] Fix hash table deletion to prevent lost entries

Dear Peter Barada,

In message 4D385A7F.2070803@logicpd.com you wrote:

...

On 01/19/2011 03:47 PM, Wolfgang Denk wrote:

...
Dear Peter Barada,

In message 4D371208.3090801@logicpd.com you wrote:

...
...
...
The hash delete code is in error; instead of just removing the deleted key, it should instead allocate a new hashtable, hash all the keys into the new table except for the deleted key and then reclaim the old table (and deleted key).

Can you please come up with a patch?

From: Peter Barada peter.barada@logicpd.com Date: Thu, 20 Jan 2011 10:38:57 -0500 Subject: [PATCH] Fix hashtable to properly handle deletion.

Use negative used value to mark deleted entry. Search keeps probing past deleted entries. Adding an entry uses first deleted entry when it hits end of probe chain.

Signed-off-by: Peter Barada peter.barada@logicpd.com

Checkpatch generates a number of errors:

[PATCH] Fix hashtable to properly handle deletion. total: 8 errors, 16 warnings, 66 lines checked

Can you please fix these, and resubmit?

Also, do you happen to have a test case that can be used show the problem in the existing code, and to test the patch?

Thanks.

Wolfgang Denk

-- DENX Software Engineering GmbH, MD: Wolfgang Denk & Detlev Zundel HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany Phone: (+49)-8142-66989-10 Fax: (+49)-8142-66989-80 Email: wd@denx.de Any sufficiently advanced bug is indistinguishable from a feature. - Rich Kulawiec

Peter Barada

22 Mar 22 Mar

12:05 a.m.

New subject: [U-Boot] [Patch v2] Fix hash table deletion to prevent lost entries

On 03/21/2011 05:48 PM, Wolfgang Denk wrote:

...

Dear Peter Barada,

In message 4D385A7F.2070803@logicpd.com you wrote:

...
On 01/19/2011 03:47 PM, Wolfgang Denk wrote:

...
Dear Peter Barada,

In message 4D371208.3090801@logicpd.com you wrote:

...
...
...
The hash delete code is in error; instead of just removing the deleted key, it should instead allocate a new hashtable, hash all the keys into the new table except for the deleted key and then reclaim the old table (and deleted key).

Can you please come up with a patch?

From: Peter Barada peter.barada@logicpd.com Date: Thu, 20 Jan 2011 10:38:57 -0500 Subject: [PATCH] Fix hashtable to properly handle deletion.

Use negative used value to mark deleted entry. Search keeps probing past deleted entries. Adding an entry uses first deleted entry when it hits end of probe chain.

Signed-off-by: Peter Barada peter.barada@logicpd.com

Checkpatch generates a number of errors:

[PATCH] Fix hashtable to properly handle deletion. total: 8 errors, 16 warnings, 66 lines checked

Can you please fix these, and resubmit?

Updated patch attached (Thunderbird munched tabs)...

...

Also, do you happen to have a test case that can be used show the problem in the existing code, and to test the patch?

No, I don't have a testcase off hand (IIRC hashtable size is dependent on size of u-boot and amount of RAM), from my original email:

In message 4D34C85E.4030408@logicpd.com you wrote:

...

...
After spending an entire day digging into the hash using GDB/BDI on my ARM board, I've findally figured out that the hash key of "ramdiskimage" and "preboot" are the same modulus 347, and this is problematic because on the initial hash import, preboot is placed into the hash first (at idx 190 since the table is sorted), and then ramdiskimage collides with preboot causing the 2nd probe (at idx 191) to occur which works fine. Unfortunately as part of the housecleaning, preboot is deleted and the environemnt saved. The delete of preboot removes entry at idx 190 and the next lookup of ramdiskimage sees that idx 190 is empty and believes that the ramdiskimage is not in the table ionstead of rehashing to find it at idx 191.

Hope this helps...

-- Peter Barada peter.barada@logicpd.com

Wolfgang Denk

10:45 p.m.

New subject: [U-Boot] [Patch v2] Fix hash table deletion to prevent lost entries

Dear Peter Barada,

In message 4D87D9B0.1080102@logicpd.com you wrote:

...

...
Can you please fix these, and resubmit?

Updated patch attached (Thunderbird munched tabs)...

Thanks.

...

...
Also, do you happen to have a test case that can be used show the problem in the existing code, and to test the patch?

No, I don't have a testcase off hand (IIRC hashtable size is dependent on size of u-boot and amount of RAM), from my original email:

I was able to verify both the problem and that your fix fixes it. Tested on "qong".

Added a Tested-by: Wolfgang Denk wd@denx.de

...

From: Peter Barada peter.barada@logicpd.com Date: Mon, 21 Mar 2011 19:01:57 -0500 Subject: [PATCH] Fix hashtable to properly handle deletion.

Use negative used value to mark deleted entry. Search keeps probing past deleted entries. Adding an entry uses first deleted entry when it hits end of probe chain.

Initially found that "ramdiskimage" and "preboot" collide modulus 347, causing "preboot" to be inserted at idx 190, "ramdiskimage" at idx 191. Previous to this fix when "preboot" is deleted, "ramdiskimage" is orphaned.

Signed-off-by: Peter Barada peter.barada@logicpd.com

diff --git a/lib/hashtable.c b/lib/hashtable.c index 9f069c0..fcdb53c 100644 --- a/lib/hashtable.c +++ b/lib/hashtable.c

Applied, thanks.

Best regards,

Wolfgang Denk

-- DENX Software Engineering GmbH, MD: Wolfgang Denk & Detlev Zundel HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany Phone: (+49)-8142-66989-10 Fax: (+49)-8142-66989-80 Email: wd@denx.de Just because your doctor has a name for your condition doesn't mean he knows what it is.

5161

Age (days ago)

5225

Last active (days ago)

List overview

Download

7 comments

2 participants

tags (0)

participants (2)

Peter Barada
Wolfgang Denk