淺談Android Contacts數據庫phone_lookup表的設計

轉載註明出處:https://blog.csdn.net/skysukai

在Android系統中,聯繫人數據庫是一個比較大的數據庫。一次在瀏覽contact2.db的時候發現,phone_lookup表裏的數據特別簡單:

data_id raw_contact_id normalized_number min_match
data表_id的外鍵 raw_contact表_id的外鍵 標準化的電話號碼 標準化電話號碼的倒序

看完了這個表我的心裏一大堆疑問,phone_lookup表如果是通過電話號碼查詢姓名,怎麼只有號碼沒有姓名?而且如果光是爲了查詢姓名直接在data表裏不就可以了嗎,這張表存在的意義又是什麼?
帶着這些問題讀了ContactProvider,期望從代碼中尋找答案。應用上層通過以下代碼來查詢姓名:

Uri uri = Uri.withAppendedPath(PhoneLookup.CONTENT_FILTER_URI, Uri.encode(phoneNumber));
 resolver.query(uri, new String[]{PhoneLookup.DISPLAY_NAME,...

那直接從ContactProvider的query()方法開始,找到query PHONE_LOOKUP的地方,簡略版代碼:

		    ……
String number = uri.getPathSegments().size() > 1 ? uri.getLastPathSegment() : "";
String numberE164 = PhoneNumberUtils.formatNumberToE164(number, mDbHelper.get().getCurrentCountryIso());
String normalizedNumber = PhoneNumberUtils.normalizeNumber(number);
//設置要查詢的表
mDbHelper.get().buildPhoneLookupAndContactQuery(qb, normalizedNumber, numberE164);
//設置project
qb.setProjectionMap(sPhoneLookupProjectionMap);
                   ……
final Cursor fallbackCursor = doQuery(db, qb, projectionWithNumber,
           selection, selectionArgs, sortOrder, groupBy, having, limit,
            cancellationSignal);
                     ……

跟蹤到doQuery()方法裏邊去,得到最終的SQL查詢語句:

SELECT data1 AS number,
       contacts_view._id AS contact_id,
       contacts_view.photo_uri AS photo_uri,
       contacts_view.send_to_voicemail AS send_to_voicemail,
       data_id AS data_id,
       contacts_view.lookup AS lookup,
       contacts_view.display_name AS display_name,
       contacts_view.last_time_contacted AS last_time_contacted,
       contacts_view.has_phone_number AS has_phone_number,
       contacts_view.in_visible_group AS in_visible_group,
       contacts_view.photo_file_id AS photo_file_id,
       data3 AS label,
       contacts_view.starred AS starred,
       data4 AS normalized_number,
       contacts_view.photo_thumb_uri AS photo_thumb_uri,
       contacts_view.in_default_directory AS in_default_directory,
       contacts_view.photo_id AS photo_id,
       contacts_view.custom_ringtone AS custom_ringtone,
       contacts_view._id AS _id,
       data2 AS type,
       contacts_view.times_contacted AS times_contacted
  FROM raw_contacts
       JOIN
       view_contacts contacts_view ON (contacts_view._id = raw_contacts.contact_id),
       (
           SELECT data_id,
                  normalized_number,
                  length(normalized_number) AS len
             FROM phone_lookup
            WHERE (phone_lookup.min_match = '.......') 
       )
       AS lookup,
       data
 WHERE (lookup.data_id = data._id AND 
        data.raw_contact_id = raw_contacts._id AND 
        (lookup.normalized_number = '+86........' OR 
         lookup.len <= 11 AND 
         substr('.........', 11 - lookup.len + 1) = lookup.normalized_number OR 
         (lookup.len > 11 AND 
          substr(lookup.normalized_number, lookup.len + 1 - 11) = '........') ) ) 
 ORDER BY length(lookup.normalized_number) DESC

可以看到,最終phone_lookup的數據是由raw_contactview_contactsphone_lookupdata這四張表組合查詢得到,最終返回的數據中包含了numberdisplay_namestarredphoto_thumb_uri等一系列會在來電界面上顯示的關鍵字段。
回到最開始的問題,phone_lookup表存在的意義是什麼,上面那些數據不需要phone_lookup表也可以直接查詢得到啊。
上面那段查詢語句稍顯複雜,我們分解來看。首先是phone_lookup表的分解:

SELECT data_id,
       normalized_number,
       min_match,
       len
  FROM (
           SELECT data_id,
                  normalized_number,
                  min_match,
                  length(normalized_number) AS len
             FROM phone_lookup
       )
       AS lookup
 WHERE (lookup.normalized_number = '+86........' OR 
        lookup.len <= 11 AND 
        substr('........', 11 - lookup.len + 1) = lookup.normalized_number OR 
        (lookup.len > 11 AND 
         substr(lookup.normalized_number, lookup.len + 1 - 11) = '........') );

得到數據庫表結構:

data_id normalized_number min_match len
標準化電話號後的長度

大意就是從phone_lookup表中篩選出來電的那個電話號碼,至於爲什麼寫這麼複雜,只能猜測各個國家的電話號碼可能都不一樣,這裏做了統一處理。
繼續回到第一個query語句,WHERE之後的篩選條件:

WHERE (lookup.data_id = data._id AND 
        data.raw_contact_id = raw_contacts._id
        ……

查看data表及phone_lookup表的index:

名稱 字段
data_raw_contact_id raw_contact_id
data_mimetype_data1_index mimetype_id, data1
data_hash_id_index hash_id
名稱 字段
phone_lookup_index normalized_number, raw_contact_id, data_id
phone_lookup_min_match_index min_match, raw_contact_id, data_id
phone_lookup_data_id_min_match_index data_id, min_match

可以看到,在第一個篩選條件lookup.data_id = data._id和第二個篩選條件data.raw_contact_id = raw_contacts._id都用到了索引來提高查詢效率。在phone_lookup這張表上建了三個複合索引,基本覆蓋了這張表的字段,有關索引是怎麼提高查詢效率的,可以參考這篇文章(傳送門)。
可以肯定一點,phone_lookup這張表是通過設置索引來提高查詢效率,它存在的意義就是爲了在來電時快速顯示聯繫人姓名。在聯繫人數目較少時,查詢性能可能不是特別明顯,當聯繫人上幾十萬時,這種優化才能顯現出來。所以,谷歌在PhoneLookup的開發者文檔簡介中說的
“A table that represents the result of looking up a phone number, for example for caller ID. To perform a lookup you must append the number you want to find to CONTENT_FILTER_URI. This query is highly optimized.”這裏的“高度優化”其實是指數據庫層面的優化,而不是代碼層面的優化了。
順便說一下,phone_lookup的字段min_match號碼倒序也是爲了提高查詢速度。試想一下,如果存了幾十萬個聯繫人,號碼都是以13\15\17這種數字開頭,不如將號碼倒序並對這個字段設立索引,不失爲一種提高查詢速度的手段。

參考:https://developer.android.com/reference/android/provider/ContactsContract.PhoneLookup
參考:https://www.cnblogs.com/hyd1213126/p/5828937.html
參考:https://www.cnblogs.com/wuchanming/p/6886020.html
參考:https://www.cnblogs.com/aspwebchh/p/6652855.html

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章