Training a Tokenizer for BERT Models

Short excerpt below. Click through to read at the original source.

This article is divided into two parts; they are: • Picking a Dataset • Training a Tokenizer To keep things simple, we’ll use English text only.

Read at Source