{"id":727,"date":"2020-09-22T02:58:15","date_gmt":"2020-09-22T02:58:15","guid":{"rendered":"https:\/\/support.divominer.cn\/en\/knowledge-base\/how-to-build-a-sampling-library\/"},"modified":"2021-09-01T06:29:59","modified_gmt":"2021-09-01T06:29:59","slug":"how-to-build-a-sampling-library","status":"publish","type":"ht_kb","link":"https:\/\/support.divominer.cn\/en\/knowledge-base\/how-to-build-a-sampling-library\/","title":{"rendered":"How to create a Sample Database?"},"content":{"rendered":"\n<p><span lang=\"EN-US\">DiVoMiner\u00ae provides a fast and convenient method for data sampling. A proportion of data is sampled from the overall data to form a Sample Database, which is independent from the \u201cCoding Pool\u201d. Data in the Sample Database can perform functions such as machine coding, manual coding, statistical analysis, and visualization independently. Multiple coding databases can operate in parallel.<\/span><\/p>\n\n\n\n<p><span lang=\"EN-US\">There are two ways to create a sample database. The first is to sample from the [Data Management] section on the [Overview] page. The second is to sample from the [Coding Pool]. The specific steps are shown as below:<\/span><\/p>\n\n\n\n<p><span lang=\"EN-US\">Method 1: Sampling from the [Overview] page<\/span><\/p>\n\n\n\n<p><span lang=\"EN-US\">Go to [Data Management]-[Overview], select a database, such as [News Database], and click [Sampling].<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"666\" height=\"198\" src=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/46\u62bd\u6837-1.png\" alt=\"\" class=\"wp-image-8565\" srcset=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/46\u62bd\u6837-1.png 666w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/46\u62bd\u6837-1-300x89.png 300w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/46\u62bd\u6837-1-50x15.png 50w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/46\u62bd\u6837-1-600x178.png 600w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/46\u62bd\u6837-1-320x95.png 320w\" sizes=\"(max-width: 666px) 100vw, 666px\" \/><\/figure>\n\n\n\n<p><span lang=\"EN-US\">Name the sample database to create a new sample database, or select an existing sample database (this way will add samples to the existing sample database). Click [Next].<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"707\" height=\"213\" src=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/48\u62bd\u6837-1.png\" alt=\"\" class=\"wp-image-8566\" srcset=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/48\u62bd\u6837-1.png 707w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/48\u62bd\u6837-1-300x90.png 300w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/48\u62bd\u6837-1-50x15.png 50w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/48\u62bd\u6837-1-600x181.png 600w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/48\u62bd\u6837-1-320x96.png 320w\" sizes=\"(max-width: 707px) 100vw, 707px\" \/><\/figure>\n\n\n\n<p><span lang=\"EN-US\">The sampling method can be set as:&nbsp; random sampling, or according to certain criteria (e.g. coding time), and arrange samples in ascending or descending order.<\/span><\/p>\n\n\n\n<p><span lang=\"EN-US\">Fill in the sampling size. The sampling size can be set as a specific number or a percentage of the overall data.<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"711\" height=\"269\" src=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/49\u62bd\u6837-1.png\" alt=\"\" class=\"wp-image-8567\" srcset=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/49\u62bd\u6837-1.png 711w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/49\u62bd\u6837-1-300x114.png 300w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/49\u62bd\u6837-1-50x19.png 50w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/49\u62bd\u6837-1-600x227.png 600w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/49\u62bd\u6837-1-320x121.png 320w\" sizes=\"(max-width: 711px) 100vw, 711px\" \/><\/figure>\n\n\n\n<p><span lang=\"EN-US\">Set the sampling range, that is, sampling within a specific data range. With certain sampling criteria, the sampling range is further specified.<\/span><\/p>\n\n\n\n<p><span lang=\"EN-US\">Sampling criteria:<\/span><\/p>\n\n\n\n<ul><li>[All criteria] refers to the screening conditions that requires all&nbsp;to be met&nbsp;at the same time, and the relationship between the set conditions is \u201cAND\u201d;<\/li><li>[Any criteria] refers to the \u201cOR\u201d relationship between the set conditions;&nbsp;and&nbsp;any&nbsp;one of them is met for screening.<\/li><\/ul>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"640\" height=\"247\" src=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/50\u62bd\u6837-1.png\" alt=\"\" class=\"wp-image-8568\" srcset=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/50\u62bd\u6837-1.png 640w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/50\u62bd\u6837-1-300x116.png 300w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/50\u62bd\u6837-1-50x19.png 50w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/50\u62bd\u6837-1-600x232.png 600w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/50\u62bd\u6837-1-320x124.png 320w\" sizes=\"(max-width: 640px) 100vw, 640px\" \/><\/figure>\n\n\n\n<p><span lang=\"EN-US\">Click [OK] to complete the sampling range setting.<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"712\" height=\"324\" src=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/51\u62bd\u6837-1.png\" alt=\"\" class=\"wp-image-8569\" srcset=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/51\u62bd\u6837-1.png 712w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/51\u62bd\u6837-1-300x137.png 300w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/51\u62bd\u6837-1-50x23.png 50w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/51\u62bd\u6837-1-600x273.png 600w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/51\u62bd\u6837-1-320x146.png 320w\" sizes=\"(max-width: 712px) 100vw, 712px\" \/><\/figure>\n\n\n\n<p><span lang=\"EN-US\">The sampled database is displayed in the database list under [Data Management]-[Overview] section, and independent operations can be performed directly in the sample database.<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"665\" height=\"195\" src=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/52\u62bd\u6837-1.png\" alt=\"\" class=\"wp-image-8570\" srcset=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/52\u62bd\u6837-1.png 665w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/52\u62bd\u6837-1-300x88.png 300w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/52\u62bd\u6837-1-50x15.png 50w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/52\u62bd\u6837-1-600x176.png 600w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/52\u62bd\u6837-1-320x94.png 320w\" sizes=\"(max-width: 665px) 100vw, 665px\" \/><\/figure>\n\n\n\n<p><span lang=\"EN-US\">Note: The data in the sampled database is independent from other databases. After sampling, the data capacity and file capacity will be counted, therefore the total counts of the Coding Pool will increase accordingly.<\/span><\/p>\n\n\n\n<p><span lang=\"EN-US\">Method 2: [Coding Pool] Sampling<\/span><\/p>\n\n\n\n<p><span lang=\"EN-US\">In the [Coding Pool], select a database such as [News Database], and click [Sampling]. The sampling method setting is the same as in the [Overview] page: go to the sampling setting; name the sample database; select the sampling method, and set the sampling range to complete the sampling.<\/span><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"1024\" height=\"336\" src=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/15\u62bd\u6837-1024x336.png\" alt=\"\" class=\"wp-image-8572\" srcset=\"https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/15\u62bd\u6837-1024x336.png 1024w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/15\u62bd\u6837-300x98.png 300w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/15\u62bd\u6837-768x252.png 768w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/15\u62bd\u6837-50x16.png 50w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/15\u62bd\u6837-1536x504.png 1536w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/15\u62bd\u6837-920x302.png 920w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/15\u62bd\u6837-600x197.png 600w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/15\u62bd\u6837-320x105.png 320w, https:\/\/support.divominer.cn\/en\/wp-content\/uploads\/2021\/09\/15\u62bd\u6837.png 1563w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n","protected":false},"author":2,"comment_status":"open","ping_status":"closed","template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0},"ht_kb_category":[5],"ht_kb_tag":[24],"_links":{"self":[{"href":"https:\/\/support.divominer.cn\/en\/wp-json\/wp\/v2\/ht_kb\/727"}],"collection":[{"href":"https:\/\/support.divominer.cn\/en\/wp-json\/wp\/v2\/ht_kb"}],"about":[{"href":"https:\/\/support.divominer.cn\/en\/wp-json\/wp\/v2\/types\/ht_kb"}],"author":[{"embeddable":true,"href":"https:\/\/support.divominer.cn\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/support.divominer.cn\/en\/wp-json\/wp\/v2\/comments?post=727"}],"version-history":[{"count":93,"href":"https:\/\/support.divominer.cn\/en\/wp-json\/wp\/v2\/ht_kb\/727\/revisions"}],"predecessor-version":[{"id":8573,"href":"https:\/\/support.divominer.cn\/en\/wp-json\/wp\/v2\/ht_kb\/727\/revisions\/8573"}],"wp:attachment":[{"href":"https:\/\/support.divominer.cn\/en\/wp-json\/wp\/v2\/media?parent=727"}],"wp:term":[{"taxonomy":"ht_kb_category","embeddable":true,"href":"https:\/\/support.divominer.cn\/en\/wp-json\/wp\/v2\/ht_kb_category?post=727"},{"taxonomy":"ht_kb_tag","embeddable":true,"href":"https:\/\/support.divominer.cn\/en\/wp-json\/wp\/v2\/ht_kb_tag?post=727"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}