inflaton commited on
Commit
8f3ae45
1 Parent(s): 2238e7c

phi-3.5 rpp results

Browse files
logs/l40-1gpu-5.txt ADDED
The diff for this file is too large to render. See raw diff
 
logs/l40-4gpu-8.txt CHANGED
@@ -175,3 +175,12 @@ You seem to be using the pipelines sequentially on GPU. In order to maximize eff
175
  The `seen_tokens` attribute is deprecated and will be removed in v4.41. Use the `cache_position` model input instead.
176
  2024-08-26 14:30:49,269 [WARNING] [logging.py:328] You are not running the flash-attention implementation, expect numerical differences.
177
  You seem to be using the pipelines sequentially on GPU. In order to maximize efficiency please use a dataset
 
 
 
 
 
 
 
 
 
 
175
  The `seen_tokens` attribute is deprecated and will be removed in v4.41. Use the `cache_position` model input instead.
176
  2024-08-26 14:30:49,269 [WARNING] [logging.py:328] You are not running the flash-attention implementation, expect numerical differences.
177
  You seem to be using the pipelines sequentially on GPU. In order to maximize efficiency please use a dataset
178
+ [nltk_data] Downloading package punkt to
179
+ [nltk_data] /common/home/users/d/dh.huang.2023/nltk_data...
180
+ [nltk_data] Package punkt is already up-to-date!
181
+ 2024-08-26 19:44:35,757 [WARNING] [modeling_phi3.py:62] `flash-attention` package not found, consider installing for better performance: No module named 'flash_attn'.
182
+ 2024-08-26 19:44:35,757 [WARNING] [modeling_phi3.py:66] Current `flash-attention` does not support `window_size`. Either upgrade or use `attn_implementation='eager'`.
183
+
184
+ The `seen_tokens` attribute is deprecated and will be removed in v4.41. Use the `cache_position` model input instead.
185
+ 2024-08-26 19:45:11,738 [WARNING] [logging.py:328] You are not running the flash-attention implementation, expect numerical differences.
186
+ You seem to be using the pipelines sequentially on GPU. In order to maximize efficiency please use a dataset
results/mac-results_rpp_with_mnt_2048.csv CHANGED
The diff for this file is too large to render. See raw diff