unix高级环境编程 第二十章 数据库函数库:习题讨论

来源:互联网 发布:匡恩网络有前途吗 编辑:程序博客网 时间:2024/06/10 21:36

缘由

源代码看完了,大概看懂了,现在来自己回答一下课后练习题,来看看自己是不是真的看懂了。

习题20.1

题干

在_db_dodelte中使用的加锁是比较保守的。例如,如果等到真正要用空闲链表时再加锁,则可获得更大的并发度。如果将调用write_lock移到调用_db_writedat和_db_readptr之间会发生什么呢?

_db_dodelte方法如下:

/* * Delete the current record specified by the DB structure. * This function is called by db_delete and db_store, after * the record has been located by _db_find_and_lock. */static void_db_dodelete(DB *db){inti;char*ptr;off_tfreeptr, saveptr;/* * Set data buffer and key to all blanks. */for (ptr = db->datbuf, i = 0; i < db->datlen - 1; i++)*ptr++ = SPACE;*ptr = 0;/* null terminate for _db_writedat */ptr = db->idxbuf;while (*ptr)*ptr++ = SPACE;/* * We have to lock the free list. * zy:因为要将删除的记录移动到空闲链表上,这将改变空闲链表, * 所以每次只能有一个进程这么做。 */if (writew_lock(db->idxfd, FREE_OFF, SEEK_SET, 1) < 0)err_dump("_db_dodelete: writew_lock error");/* * Write the data record with all blanks. */_db_writedat(db, db->datbuf, db->datoff, SEEK_SET);/* * Read the free list pointer.  Its value becomes the * chain ptr field of the deleted index record.  This means * the deleted record becomes the head of the free list. */freeptr = _db_readptr(db, FREE_OFF);/* * Save the contents of index record chain ptr, * before it's rewritten by _db_writeidx. */saveptr = db->ptrval;/* * Rewrite the index record.  This also rewrites the length * of the index record, the data offset, and the data length, * none of which has changed, but that's OK. */_db_writeidx(db, db->idxbuf, db->idxoff, SEEK_SET, freeptr);/* * Write the new free list pointer. */_db_writeptr(db, FREE_OFF, db->idxoff);/* * Rewrite the chain ptr that pointed to this record being * deleted.  Recall that _db_find_and_lock sets db->ptroff to * point to this chain ptr.  We set this chain ptr to the * contents of the deleted record's chain ptr, saveptr. */_db_writeptr(db, db->ptroff, saveptr);if (un_lock(db->idxfd, FREE_OFF, SEEK_SET, 1) < 0)err_dump("_db_dodelete: un_lock error");}


我的解答

我认为不能这么干。

因为

_db_writedat(db, db->datbuf, db->datoff, SEEK_SET);

这一句代码中的,db->datbuf已经被设置为空格了,(因为作业约定删除的含义,就是将其设置为空格)。如果我们先设置为了空格之后,我们再执行这一句(也就是去锁住空闲链表,因为马上要对空闲链表进行操作了):

if (writew_lock(db->idxfd, FREE_OFF, SEEK_SET, 1) < 0)err_dump("_db_dodelete: writew_lock error");
但是有一种可能发生,就是此时有另外一个线程正在处理空闲链表,此时该该进程堵塞在这一句。然而,由于我们已经把要删除的内容设置为空,这还会导致别的进程还能够去读我们刚刚已经把内容设置为空格的数据,别的读进程还以为其正常的处于其该在散列表内,正常的读取其内容,却只能读到错误的空格信息。writew_lock也不知道要堵塞到什么时候,堵塞期间那么就会一直读到错误的内容,就是空格内容。

书上的答案

给出db_nextrec的代码:

/* * Return the next sequential record. * We just step our way through the index file, ignoring deleted * records.  db_rewind must be called before this function is * called the first time. */char *db_nextrec(DBHANDLE h, char *key){DB*db = h;charc;char*ptr;/* * We read lock the free list so that we don't read * a record in the middle of its being deleted. */if (readw_lock(db->idxfd, FREE_OFF, SEEK_SET, 1) < 0)err_dump("db_nextrec: readw_lock error");do {/* * Read next sequential index record. */if (_db_readidx(db, 0) < 0) {ptr = NULL;/* end of index file, EOF */goto doreturn;}/* * Check if key is all blank (empty record). */ptr = db->idxbuf;while ((c = *ptr++) != 0  &&  c == SPACE);/* skip until null byte or nonblank */} while (c == 0);/* loop until a nonblank key is found */if (key != NULL)strcpy(key, db->idxbuf);/* return key */ptr = _db_readdat(db);/* return pointer to data buffer */db->cnt_nextrec++;doreturn:if (un_lock(db->idxfd, FREE_OFF, SEEK_SET, 1) < 0)err_dump("db_nextrec: un_lock error");return(ptr);}

我应该还是把握到了要点,只是书中答案给的更准确,注意_db_dodelte和db_nextrec都试图对空闲链表加锁,一个读的,一个是写的。注意答案中说的删除就是将内容设置为空白,然后添加到空闲链表中去。

习题20.2

题干

我的解答

这和上面那道题的分析类似。比如当两个线程,一个调用_db_dodelte,另一个调用db_nextrec,而现在db_nextrec不对空闲链表加锁,那么其可以随意的读取索引文件的内容。只要调用_db_dodelte的线程刚刚执行完

_db_writedat(db, db->datbuf, db->datoff, SEEK_SET);
操作,还没来得及改变这个记录的索引之前,db_nextrec的线程又跑去读索引和内容,那么就会得到空格数据,就是被删除的内容。

书上的答案

书中所说确实是一种情况。

但是我认为我说的也是一种正确的、可能发生的情况。除非这里我误解了一点,难道加了写锁之后,别的线程完全不能碰了?我觉得是可以碰的,也就是说另一个线程完全不顾这个地方的锁机制,然后自己干想干的。

我觉得这个地方涉及书上说的P366页的强制性锁和建议行锁的区别和联系,在这里显然加的是建议行锁,如果加的是强制性锁,那么内核会对每一个read和write还有open进行检查,有没有违背一把锁的作用。那么此时,我说的情况不会发生。

习题20.3

题干


我的解答

讨论的这几个问题还真是环环相扣呀。如果改为强制性锁,那么内核会对每一个read和write还有open进行检查,有没有违背一把锁的作用,所以所有试图对这个区域进行read、write的操作的函数都会堵塞。

所以改为强制性锁后,题干结论不成立。

书上的答案


习题20.4

题干


我的解答

fsync的作用在书上61页有写,当然也可以自己man一下。
简单来说,fsync的作用就让发生在缓冲区内修改过的内容立刻冲洗到磁盘。并且fsync会堵塞等待这个过程完成后才会返回。书中有句重要的提示就是:fsync用于数据库这样的应用程序,确保将修改过的块立刻写到磁盘上。

基本上这句话已经告诉我们答案了,我认为在修改数据库文件的内容的时候恐怕都需要调用这个函数,并且将文件描述符传给这个函数,注意, 我们有两个文件,一个索引文件、还有一个是数据库文件。所以,我觉得在调用了:

int       db_store(DBHANDLE, const char *, const char *, int);int       db_delete(DBHANDLE, const char *);/* * Flags for db_store(). */#define DB_INSERT       1    /* insert new record only */#define DB_REPLACE       2    /* replace existing record */#define DB_STORE       3    /* replace or insert */

下面两个函数时候都有必要调用一下fsync函数。

书上的答案

没有这道题的答案。

习题20.5

题干

大概是说按照这样的顺序:

_db_writedat(db, data, 0, SEEK_END);_db_writeidx(db, key, 0, SEEK_END, ptrval);

我的解答

很显然,如果是当写了索引记录之后,再去写数据记录,很有可能造成一种情况就是,当写了索引记录之后,进程发生了切换,当然别的进程又去读了这个索引,所以拿数据,显然是没有数据(可能抛空)。

但是注意,由于db_store是会调用_db_find_and_lock函数,这个函数会锁住散列链,那么某些函数db_fetch和db_delete都会拿调用_db_find_and_lock函数,也都会对同一个地方试图加锁,所以这两个函数倒不会有问题。但是db_nextrec这个函数是不会去拿散列表的锁的,所以和会调用db_nextrec的函数产生竞争。db_nextrec函数读了索引,却没有数据。

书上的答案

我怎么觉得还没我分析的用心呢?

习题20.6

题干

我的解答

这个问题,我觉得倒可以简化一下,因为实验的要求只是想知道一下散列函数是否恰当。从db_nextrec来读取记录,又计算记录的散列值,太麻烦了。我就直接用其_db_hash来算很多很多词的散列值。对于一个散列函数是否好,还有有一个评价的标准:分布的是否均匀。我就用这个标准来衡量一下。584页,说了137是素数,所以具有良好的分布特性。

实验过程以及代码如下:

/** * 1.读取wordlist的里面的每一个单词(共27220个),没有重复 * 2.把每一个单词当作key,计算其hash值 * 3.将以hash值为数组的下标的元素+1 * 4.打印统计每一个hash值的个数 */#include <stdio.h>#define MAXLINE 137//单词我都已经看过了.都比较短,不超过20个,保险起见稍稍填写高一些typedef unsigned longDBHASH;/* hash values */#define NHASH_DEF137/* hash table size */#define NULL ((void *)0)int static hash[NHASH_DEF];/** * 改写来自db.c的_db_hash函数 */void _db_hash(const char *key){DBHASHhval = 0;charc;inti;for (i = 1; (c = *key++) != 0; i++)hval += c * i;/* ascii char times its 1-based index */hash[hval % NHASH_DEF]++;}int main(int argc, char **argv) {FILE *fp;fp=fopen("wordlist.txt","r");char word[MAXLINE];while(1){if(fgets(word,MAXLINE,fp)==NULL){//读的一行包括换行符号,并且在最后再添加一个‘\0’fprintf(stderr,"get word failed\n");//读到结尾或者出错都会有问题break;}_db_hash(word);}int i,j;for(i=0;i<NHASH_DEF;i++){printf("%d\t%d:",i,hash[i]);for(j=0;j<hash[i]/10;j++){printf(".");}printf("\n");}}


结果,直方图用点表示,每10个word一个点:

0274:...........................1257:.........................2286:............................3280:............................4288:............................5272:...........................6283:............................7281:............................8278:...........................9279:...........................10289:............................11282:............................12291:.............................13275:...........................14297:.............................15285:............................16305:..............................17269:..........................18293:.............................19284:............................20279:...........................21290:.............................22290:.............................23258:.........................24284:............................25287:............................26280:............................27271:...........................28270:...........................29251:.........................30265:..........................31252:.........................32277:...........................33253:.........................34248:........................35250:.........................36253:.........................37242:........................38248:........................39231:.......................40232:.......................41228:......................42226:......................43215:.....................44221:......................45208:....................46213:.....................47200:....................48201:....................49193:...................50212:.....................51180:..................52173:.................53183:..................54195:...................55161:................56184:..................57171:.................58168:................59168:................60178:.................61141:..............62148:..............63155:...............64146:..............65128:............66124:............67135:.............68138:.............69130:.............70115:...........71115:...........72130:.............73103:..........74111:...........75101:..........76109:..........7787:........78103:..........7995:.........80103:..........8199:.........82105:..........83107:..........84108:..........85108:..........86108:..........8796:.........88103:..........8998:.........90106:..........91114:...........92108:..........93105:..........94113:...........95121:............96122:............97115:...........98136:.............99146:..............100122:............101138:.............102154:...............103145:..............104135:.............105159:...............106157:...............107158:...............108169:................109181:..................110167:................111184:..................112187:..................113178:.................114177:.................115210:.....................116196:...................117210:.....................118210:.....................119210:.....................120220:......................121241:........................122209:....................123235:.......................124223:......................125240:........................126233:.......................127259:.........................128231:.......................129262:..........................130245:........................131272:...........................132252:.........................133268:..........................134255:.........................135275:...........................136281:............................

从图中来看,分布基本上比较均匀,但是这其实是一个很复杂的问题,我试着取了一个130,反正不是素数的数吧,结果如下(我觉得好坏肉眼无法判定,我不太想研究,所以没研究了):

0281:............................1277:...........................2295:.............................3299:.............................4271:...........................5282:............................6275:...........................7292:.............................8255:.........................9270:...........................10266:..........................11250:.........................12258:.........................13257:.........................14258:.........................15263:..........................16241:........................17238:.......................18225:......................19224:......................20227:......................21227:......................22223:......................23210:.....................24206:....................25216:.....................26200:....................27193:...................28204:....................29184:..................30186:..................31198:...................32190:...................33166:................34161:................35163:................36153:...............37145:..............38132:.............39137:.............40131:.............41147:..............42137:.............43133:.............44137:.............45128:............46119:...........47135:.............48120:............49138:.............50106:..........51125:............52123:............53115:...........54119:...........55106:..........56117:...........57136:.............58105:..........59119:...........60114:...........61121:............62116:...........63142:..............64118:...........65125:............66130:.............67133:.............68140:..............69143:..............70142:..............71142:..............72154:...............73149:..............74162:................75165:................76155:...............77175:.................78170:.................79190:...................80185:..................81199:...................82176:.................83191:...................84199:...................85218:.....................86196:...................87217:.....................88217:.....................89223:......................90217:.....................91246:........................92221:......................93236:.......................94233:.......................95254:.........................96239:.......................97250:.........................98245:........................99249:........................100261:..........................101271:...........................102257:.........................103267:..........................104264:..........................105268:..........................106263:..........................107261:..........................108269:..........................109292:.............................110267:..........................111276:...........................112270:...........................113281:............................114277:...........................115309:..............................116280:............................117299:.............................118276:...........................119303:..............................120285:............................121297:.............................122295:.............................123305:..............................124285:............................125307:..............................126278:...........................127289:............................128283:............................129294:.............................


书上的解答

没有答案

习题20.7

题干

我的解答

这个题,挺没意思。 反正使用到了NHASH_DEF一共四个地方,把这四个地方改为db->nhash,再在db_open函数里多写一个参数给db->nhash赋值即可。还有些细节,比如多写一个参数给db->nhash赋值一定要在最开始的时候执行。在所有使用到db->nhash的地方之前。

书上的解答

没有答案点击打开链接

习题20.8

题干

我的解答

找了很久,连英文都没有答案,不过通过我阅读来看,感觉应该是不支持了。如果使用NFS的话,好像有自己的一套机制。
参见:NFS中的文件锁

书上的解答

书上没有答案


0 0
原创粉丝点击