Chat2Find
/

Chat2Find-CPT

@@ -42,9 +42,10 @@ Chat2Find-CPT is a specialized version of the Qwen 3.5 4B model, enhanced via **
 - **Batch Size:** 2 (local) / 8 (global with Gradient Accumulation)
 ### Dataset
-The model was trained on a curated corpus of ~270,000 sequences focusing on:
-- **Sri Lankan News & Media:** Current events and reporting styles.
-- **Cultural Context:** General web-scraped data reflecting local nuances.
 ## Capabilities

 - **Batch Size:** 2 (local) / 8 (global with Gradient Accumulation)
 ### Dataset
+The model underwent true Continued Pre-Training on a massive 1.38 GB unstructured text corpus. The data was densely packed into:
+- **Size:** ~270,000 packed sequences of 2048 tokens each (**~550 Million total tokens**).
+- **Epochs:** 1 Epoch (Standard pre-training practice to prevent overfitting).
+- **Content:** Sri Lankan News & Media, Cultural Context, and domain-specific raw web data.
 ## Capabilities