Spaces:

fyang0507
/

my-notion-companion

Sleeping

App Files Files Community

fyang0507 commited on Mar 13

Commit

7a6abb6

•

1 Parent(s): b5f7613

add resources

Browse files

Files changed (8) hide show

resources/embedding_model_scores.csv +11 -0
resources/flowchart.png +0 -0
resources/langsmith_walkthrough.mp4 +3 -0
resources/link-to-flowchart.txt +1 -0
resources/llm_scores.csv +94 -0
resources/search-limit-chinese.png +0 -0
resources/test_llm_standalone_chat.txt +21 -0
resources/vectordatabase_evaluation.csv +6 -0

resources/embedding_model_scores.csv ADDED Viewed

	@@ -0,0 +1,11 @@

+Model name,Provider,Model size (#pamras),Model Size (disk),Download past month,Highlights,Time Load/Inference (online compute),Mean difference paired & unpaired Q & Docs
+intfloat/multilingual-e5-large,Microsoft,560M,2.2G,93K,24 layers and the embedding size is 1024,5.0s/1920s,0.062
+intfloat/multilingual-e5-base,Microsoft,278M,1.1G,42K,12 layers and the embedding size is 768,3.4s/531s,0.063
+sentence-transformers/LaBSE,Google,,1.9G,88K,the embedding size is 768,5.7s/620s,0.19
+maidalun1020/bce-embedding-base_v1,NetEase-Youdao,279M,1.1G,111K,optimized for RAG,3.0s/495s,0.23
+BAAI/bge-large-zh-v1.5,Beijing Academy of Artificial Intelligence,326M,1.3G,22K,,1.6s/1730s,0.26
+uer/sbert-base-chinese-nli,Tencent,,409M,8K,12 layers and the embedding size is 768,0.6s/1350s,0.22
+sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2,Sentence Transformer,,449M,38K,384 embedding size,1.4s/392s,0.25
+sentence-transformers/distiluse-base-multilingual-cased-v1,Sentence Transformer,,539M,31K,768 embedding size,1.3s/163s,0.28
+sentence-transformers/distiluse-base-multilingual-cased-v2,Sentence Transformer,,539M,43K,768 enbedding size,1.2s/164s,0.25
+sentence-transformers/paraphrase-multilingual-mpnet-base-v2,Sentence Transformer,,1.1G,24K,768 embedding size,2.7s/463s,0.21

resources/flowchart.png ADDED Viewed

resources/langsmith_walkthrough.mp4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:463ca06f3e980b75c325e5cc93656a7cc7885906ce1a1eb95e6e6c230fae5ce2
+size 9444302

resources/link-to-flowchart.txt ADDED Viewed

	@@ -0,0 +1 @@


1	+ https://lucid.app/lucidchart/286fb57c-e78a-4723-857b-cbd92252af8a/edit

resources/llm_scores.csv ADDED Viewed

	@@ -0,0 +1,94 @@

+,baichuan-inc/Baichuan-7B,hfl/chinese-alpaca-2-7b,Qwen/Qwen-7B-Chat,Qwen/Qwen1.5-7B-Chat,HuggingFaceH4/zephyr-7b-beta,01-ai/Yi-6B-Chat,BAAI/AquilaChat2-7B-16K
+"Overall (lower the better)",17,15,11,10,14,24,Generate nonsense
+Ready to use,"2
+Some special tokens returned but easy to clean up","1
+No special tokens in responses","2
+Some special tokens returned but easy to clean up","1
+Robust even without system prompt","1
+No special tokens in responses","3
+Returning many unrelated texts, indicating post-processing requirements",NA
+Instruction following - general,"2
+Having problem distinguish the poem types","2
+Having problem distinguish the poem types","1
+Perfectly follows","2
+almost follows except one case with Chinese-only instruction","2
+Perfectly follows if ignoring language requirements on poems","5
+Occasionally not answering questions at all",NA
+Instruction Following - language,"3
+Always answers in Chinese","3
+Always answers in Chinese","1
+Perfectly distinguishing output language requirements","2
+almost follows except one case with Chinese-only instruction","2
+Having problem with citing Chinese poems","1
+Perfectly distinguishing output language requirements",NA
+Helpfulness and Creativeness,"1
+Answer questions with helpful contexts","1
+Answer questions with helpful contexts","2
+Very concise, sometimes too concise","1
+Answer questions with helpful contexts","1
+Answer questions with helpful contexts","3
+Too verbose",NA
+Fact,"2
+Wrong answers for citing poem","2
+Wrong answers for citing poem","1
+No obvious mistakes","1
+No obvious mistakes","2
+Wrong answers for citing poem","3
+Wrong answers for citing poem and country terriory",NA
+Reasoning,"2
+Self-consistent in reasoning but factually wrong","2
+Self-consistent in reasoning but factually wrong","2
+Self-consistent in reasoning but factually wrong","1
+Self-consistent and factually correct","2
+Self-consistent in reasoning but factually wrong","2
+Self-consistent in reasoning but factually wrong",NA
+Coding,"3
+Valid code but wrong formatting or explanation","3
+Valid code but wrong formatting or explanation","1
+Perfect codes with explanation","1
+Perfect codes with explanation","1
+Perfect codes with explanation","4
+Nonsense",NA
+Inference Speed,2,1,1,1,3,3,3

resources/search-limit-chinese.png ADDED Viewed

resources/test_llm_standalone_chat.txt ADDED Viewed

	@@ -0,0 +1,21 @@

+你是谁？
+李白是谁？
+请说出李白写过的三首诗的名字。
+请全文背诵第二首诗。
+李白和杜甫认识吗？请展示你的思考过程并陈述结论。
+忘记前面的对话。告诉我到底莎士比亚的作品到底是哈姆雷特还是哈姆莱特？
+请以莎士比亚为主题写一首古体诗，要求是七言绝句。
+请以莎士比亚为主题写一首现代诗，不超过150字。
+who created you?
+Name the top 3 countries in the world based on how big they are.
+I want to find the day of the week for the current date. Please write code in Python to fulfill such requirement.

resources/vectordatabase_evaluation.csv ADDED Viewed

	@@ -0,0 +1,6 @@

+,Redis,OpenSearch,Chroma,FAISS,Qdrant,Superbase,Pinecone
+Offline/Local mode,Y,Y,Y,Y,Y,N,N
+Serverless,N,N (requires docker),Y,Y,Y,N,Y
+Offload to In-disk memory,Y,Y,Y,Y,N (can’t reload),Y,N
+Support self-query,Y,Y,Y,N,Y,Y,Y
+Support fuzzy match,"CONTAIN, LIKE","CONTAIN, LIKE",N,N,LIKE,LIKE,IN