Byte-pair encoding

en

WikiRank.net
ver. 1.6.2

Byte-pair encoding

Quality:

Byte-pair encoding - data compression in which the most common pair of consecutive bytes is replaced with a byte that doesn't occur within the data. Article “Byte-pair encoding” in English Wikipedia has 30.7 points for quality (as of July 1, 2025). The article contains 16 references and 7 sections.

This article has the best quality in Catalan Wikipedia. However, this article is the most popular in English version.

Since the creation of article “Byte-pair encoding”, its content was written by 47 registered users of English Wikipedia and edited by 87 registered Wikipedia users in all languages.

The article is cited 164 times in English Wikipedia and cited 354 times in all languages.

The highest Authors Interest rank from 2001:

  • Local (English): #22749 in May 2025
  • Global: #44420 in May 2025

The highest popularity rank from 2008:

  • Local (English): #89442 in March 2025
  • Global: #159263 in March 2025

There are 10 language versions for this article in the WikiRank database (of the considered 55 Wikipedia language editions).

The quality and popularity assessment was based on Wikipédia dumps from July 1, 2025 (including revision history and pageviews for previous years).

The table below shows the language versions of the article with the highest quality.

Languages with the highest quality

#LanguageQuality gradeQuality score
1Catalan (ca)
Codificació de parells de bytes
33.7639
2Persian (fa)
کدگذاری جفت بایت
30.8149
3English (en)
Byte-pair encoding
30.6706
4German (de)
Byte Pair Encoding
27.8718
5Korean (ko)
바이트 페어 인코딩
23.0046
6French (fr)
Byte pair encoding
20.2456
7Spanish (es)
Codificación de pares de bytes
19.285
8Arabic (ar)
ترميز زوج البايتات
15.3802
9Chinese (zh)
字节对编码
2.0407
10Japanese (ja)
バイト対符号化
0.9264
More...

The following table shows the most popular language versions of the article.

Most popular in all the time

The most popular language versions of the article "Byte-pair encoding" in all the time
#LanguagePopularity awardRelative popularity
1English (en)
Byte-pair encoding
510 107
2Japanese (ja)
バイト対符号化
11 270
3Chinese (zh)
字节对编码
9 889
4Arabic (ar)
ترميز زوج البايتات
4 174
5Spanish (es)
Codificación de pares de bytes
1 803
6German (de)
Byte Pair Encoding
418
7French (fr)
Byte pair encoding
272
8Korean (ko)
바이트 페어 인코딩
207
9Persian (fa)
کدگذاری جفت بایت
73
10Catalan (ca)
Codificació de parells de bytes
58
More...

The following table shows the language versions of the article with the highest popularity in the last month.

Most popular in June 2025

The most popular language versions of the article "Byte-pair encoding" in June 2025
#LanguagePopularity awardRelative popularity
1English (en)
Byte-pair encoding
9 023
2Chinese (zh)
字节对编码
135
3French (fr)
Byte pair encoding
122
4Japanese (ja)
バイト対符号化
108
5German (de)
Byte Pair Encoding
102
6Spanish (es)
Codificación de pares de bytes
59
7Arabic (ar)
ترميز زوج البايتات
30
8Korean (ko)
바이트 페어 인코딩
30
9Persian (fa)
کدگذاری جفت بایت
22
10Catalan (ca)
Codificació de parells de bytes
18
More...

The following table shows the language versions of the article with the highest Authors’ Interest.

The highest AI

Language versions of the article "Byte-pair encoding" with the highest Authors Interest (number of authors). Only registered Wikipedia users were taken into account.
#LanguageAI awardRelative AI
1English (en)
Byte-pair encoding
47
2Arabic (ar)
ترميز زوج البايتات
11
3German (de)
Byte Pair Encoding
8
4Japanese (ja)
バイト対符号化
6
5Chinese (zh)
字节对编码
5
6Spanish (es)
Codificación de pares de bytes
3
7French (fr)
Byte pair encoding
3
8Persian (fa)
کدگذاری جفت بایت
2
9Catalan (ca)
Codificació de parells de bytes
1
10Korean (ko)
바이트 페어 인코딩
1
More...

The following table shows the language versions of the article with the highest Authors’ Interest in the last month.

The highest AI in June 2025

Language versions of the article "Byte-pair encoding" with the highest AI in June 2025
#LanguageAI awardRelative AI
1French (fr)
Byte pair encoding
1
2Arabic (ar)
ترميز زوج البايتات
0
3Catalan (ca)
Codificació de parells de bytes
0
4German (de)
Byte Pair Encoding
0
5English (en)
Byte-pair encoding
0
6Spanish (es)
Codificación de pares de bytes
0
7Persian (fa)
کدگذاری جفت بایت
0
8Japanese (ja)
バイト対符号化
0
9Korean (ko)
바이트 페어 인코딩
0
10Chinese (zh)
字节对编码
0
More...

The following table shows the language versions of the article with the highest number of citations.

The highest CI

Language versions of the article "Byte-pair encoding" with the highest Citation Index (CI)
#LanguageCI awardRelative CI
1English (en)
Byte-pair encoding
164
2Japanese (ja)
バイト対符号化
84
3Chinese (zh)
字节对编码
57
4Arabic (ar)
ترميز زوج البايتات
42
5Spanish (es)
Codificación de pares de bytes
3
6French (fr)
Byte pair encoding
2
7German (de)
Byte Pair Encoding
1
8Persian (fa)
کدگذاری جفت بایت
1
9Catalan (ca)
Codificació de parells de bytes
0
10Korean (ko)
바이트 페어 인코딩
0
More...

Scores

Estimated value for Wikipedia:
English:
Global:
Popularity in June 2025:
English:
Global:
Popularity in all years:
English:
Global:
Authors in June 2025:
English:
Global:
Registered authors in all years:
English:
Global:
Citations:
English:
Global:

Quality measures

Interwikis

#LanguageValue
arArabic
ترميز زوج البايتات
caCatalan
Codificació de parells de bytes
deGerman
Byte Pair Encoding
enEnglish
Byte-pair encoding
esSpanish
Codificación de pares de bytes
faPersian
کدگذاری جفت بایت
frFrench
Byte pair encoding
jaJapanese
バイト対符号化
koKorean
바이트 페어 인코딩
zhChinese
字节对编码

Popularity rank trends

Best Rank English:
#89442
03.2025
Global:
#159263
03.2025

AI rank trends

Best Rank English:
#22749
05.2025
Global:
#44420
05.2025

Languages comparison

Important global interconnections (July 2024 – June 2025)

Wikipedia readers most often find their way to information on Byte-pair encoding from Wikipedia articles about Large language model, Transformer, QR code, Bidirectional encoder representations from transformers and BPE. Whereas reading the article about Byte-pair encoding people most often go to Wikipedia articles on Sequitur algorithm, Re-Pair, N-gram, Large language model and Lookup table.

Cumulative results of quality and popularity of the Wikipedia article

List of Wikipedia articles in different languages (starting with the most popular):

News from 12 August 2025

On 12 August 2025 in multilingual Wikipedia, Internet users most often read articles on the following topics: Cristiano Ronaldo, Wednesday, Georgina Rodríguez, ChatGPT, Weapons, Jenna Ortega, deaths in 2025, Taylor Swift, Miguel Uribe Turbay, 2025–26 UEFA Champions League.

In English Wikipedia the most popular articles on that day were: Weapons (2025 film), Danielle Spencer (American actress), Cristiano Ronaldo, Deaths in 2025, Taylor Swift, Wednesday (TV series), Superman (2025 film), Georgina Rodríguez, Coolie (2025 film), The Fantastic Four: First Steps.

About WikiRank

The WikiRank project is intended for automatic relative evaluation of the articles in the various language versions of Wikipedia. At the moment the service allows to compare over 44 million Wikipedia articles in 55 languages. Quality scores of articles are based on Wikipedia dumps from July, 2025. When calculating current popularity and AI of articles data from June 2025 was taken into account. For historical values of popularity and AI WikiRank used data from 2001 to 2025... More information