thwiki dump progress on 20200801
This is the Wikimedia dump service.
Please read the copyrights information.
See Meta:Data dumps
for documentation on the provided data formats.
The 7zip decoder on Windows is known to have
problems with some bz2-format
files for larger wikis; we recommend the use of bzip2 for Windows for these cases.
Please report problems with these dumps on Phabricator and add the
Dumps-generation tag.
See all databases list.
Last dumped on 2020-07-20
For a machine-readable version of the information on this page,
see the json status file.
Dump complete
Verify downloaded files against the (md5), (sha1) checksums
to check for corrupted files.
- 2020-08-03 09:28:09 done Articles, templates, media/file descriptions, and primary meta-pages, in multiple bz2 streams, 100 pages per stream
- 2020-08-07 07:02:33 done All pages with complete edit history (.7z)
- 2020-08-07 04:26:49 done All pages with complete page edit history (.bz2)
b'2020-08-07 04:26:12: thwiki (ID 37739) 856927 pages (8.3|186004.7/sec all|curr), 8126524 revs (78.5|113.7/sec all|curr), 99.3%|99.4% prefetched (all|curr), ETA 2020-08-07 07:32:28 [max 9003583]'
- 2020-08-05 23:40:00 done Log events to all pages and users.
- 2020-08-04 22:27:13 done All pages, current versions only.
- 2020-08-02 20:57:44 done Articles, templates, media/file descriptions, and primary meta-pages.
- 2020-08-01 16:41:46 done First-pass for page XML data dumps
- 2020-08-05 23:32:02 done Extracted page abstracts for Yahoo
b'2020-08-05 23:31:56: thwiki (ID 33769) 1970 pages (25.5|51.3/sec all|curr), 1970 revs (25.5|25.2/sec all|curr), ETA 2020-08-06 11:41:09 [max 1117250]'
- 2020-08-05 20:37:11 done List of all page titles
- 2020-08-05 20:36:53 done List of page titles in main namespace
- 2020-08-05 20:36:41 done Namespaces, namespace aliases, magic words.
- 2020-08-02 00:55:52 done Category information.
- 2020-08-02 01:03:20 done Wiki template inclusion link records.
- 2020-08-02 01:01:09 done This contains the SiteMatrix information from meta.wikimedia.org provided as a table.
- 2020-08-02 01:01:23 done Interwiki link tracking records
- 2020-08-02 01:00:07 done Newer per-page restrictions table.
- 2020-08-02 01:00:20 done User group assignments.
- 2020-08-02 00:57:37 done A few statistics such as the page count.
- 2020-08-02 01:02:06 done Annotation (tag) names and ids.
- 2020-08-02 00:57:49 done Language proficiency information per user.
- 2020-08-02 00:59:21 done Wiki page-to-page link records.
- 2020-08-02 01:02:46 done List of annotations (tags) for revisions and log entries
- 2020-08-02 01:01:51 done Base per-page data (id, title, old restrictions, etc).
- 2020-08-02 00:56:47 done Wiki interlanguage link records.
- 2020-08-02 00:55:19 done Redirect list
- 2020-08-02 00:57:21 done Nonexistent pages that have been protected.
- 2020-08-02 01:00:34 done List of pages' geographical coordinates
- 2020-08-02 01:03:38 done Wiki media/files usage records.
- 2020-08-02 00:54:49 done Past user group assignments.
- 2020-08-02 01:02:21 done Tracks which pages use which Wikidata items or properties and what aspect (e.g. item label) is used.
- 2020-08-02 00:59:53 done Wiki external URL link records.
- 2020-08-02 01:00:51 done Metadata on current versions of uploaded media/files.
- 2020-08-02 01:04:07 done Wiki category membership link records.
- 2020-08-02 00:58:03 done Name/value pairs for pages.