MythicalCow
commited on
readme update
Browse files
README.md
CHANGED
@@ -93,6 +93,21 @@ widget:
|
|
93 |
|
94 |
'
|
95 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
96 |
|
97 |
# SentenceTransformer based on Snowflake/snowflake-arctic-embed-l
|
98 |
|
@@ -3614,4 +3629,8 @@ You can finetune this model on your own dataset.
|
|
3614 |
## Model Card Contact
|
3615 |
|
3616 |
*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
|
3617 |
-
-->
|
|
|
|
|
|
|
|
|
|
93 |
|
94 |
'
|
95 |
---
|
96 |
+
# Github and Technical Report
|
97 |
+
To view our technical report and for access to other model and dataset related scripts visit our github [Github](https://github.com/khoj-ai/timely/tree/main)
|
98 |
+
|
99 |
+
# Timely: An Embeddings Model For Temporal Reasoning
|
100 |
+
|
101 |
+
At Khoj, we develop open-source personal AI to simplify how people engage with machines. The RAG component in modern AI systems commonly uses an embedding model to retrieve relevant documents for a user query. This retrieved-context enables accurate and personalized responses.
|
102 |
+
|
103 |
+
However, most of these models struggle with temporal reasoning. For instance, if asked "Where was I last summer?", the model would struggle to understand the framing of that question. It requires us to understand the relativity of time (that 2010 is before 2011), and when summer might be (between May - September).
|
104 |
+
|
105 |
+
When we express dates, we often use shorthands like ‘back in June’, ‘on summer break’, and ‘06/15’; all syntaxes that models don’t presently handle well. As such, your embedding model may not find documents with dates within that specific period. This limitation is significant, given the importance of time and date in language and daily life.
|
106 |
+
|
107 |
+
To address this problem, we propose **Timely**, a comprehensive pipeline for date-aware dataset generation, model fine-tuning, and benchmarking. Specifically, our goal is to create models that can:
|
108 |
+
|
109 |
+
1. Identify natural language dates in queries and documents better
|
110 |
+
2. Can handle relative and soft data filters more naturally (e.g. discerning that June is closer than November when talking about Spring).
|
111 |
|
112 |
# SentenceTransformer based on Snowflake/snowflake-arctic-embed-l
|
113 |
|
|
|
3629 |
## Model Card Contact
|
3630 |
|
3631 |
*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
|
3632 |
+
-->
|
3633 |
+
- loss:MultipleNegativesRankingLoss
|
3634 |
+
datasets:
|
3635 |
+
- sentence-transformers/wikihow
|
3636 |
+
---
|