QizhiPei

QizhiPei

AI & ML interests

AI4Science, Natural Language Processing

Recent Activity

liked a model about 2 hours ago
Qwen/Qwen2.5-7B-Instruct
liked a dataset about 22 hours ago
nampdn-ai/tiny-codes
liked a model 12 days ago
nomic-ai/nomic-embed-text-v2-moe
View all activity

Organizations

None yet

QizhiPei's activity

reacted to RishabhBhardwaj's post with 👍 7 months ago
view post
Post
2470
🎉 We are thrilled to share our work on model merging. We proposed a new approach, Della-merging, which combines expert models from various domains into a single, versatile model. Della employs a magnitude-based sampling approach to eliminate redundant delta parameters, reducing interference when merging homologous models (those fine-tuned from the same backbone).

Della outperforms existing homologous model merging techniques such as DARE and TIES. Across three expert models (LM, Math, Code) and their corresponding benchmark datasets (AlpacaEval, GSM8K, MBPP), Della achieves an improvement of 3.6 points over TIES and 1.2 points over DARE.

Paper: DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling (2406.11617)
Github: https://github.com/declare-lab/della

@soujanyaporia @Tej3
  • 3 replies
·
reacted to qgao007's post with 🤗 about 1 year ago
view post
Post
Hello world!

  • 1 reply
·
reacted to their post with 👍 about 1 year ago
view post
Post
🚀⭐️Introducing our new survey "Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey"

arxiv: https://arxiv.org/abs/2403.01528
github: https://github.com/QizhiPei/Awesome-Biomolecule-Language-Cross-Modeling

The integration of biomolecular modeling with natural language (BL) has emerged as a promising interdisciplinary area at the intersection of artificial intelligence, chemistry and biology. This approach leverages the rich, multifaceted descriptions of biomolecules contained within textual data sources to enhance our fundamental understanding and enable downstream computational tasks such as biomolecule property prediction. The fusion of the nuanced narratives expressed through natural language with the structural and functional specifics of biomolecules described via various molecular modeling techniques opens new avenues for comprehensively representing and analyzing biomolecules. By incorporating the contextual language data that surrounds biomolecules into their modeling, BL aims to capture a holistic view encompassing both the symbolic qualities conveyed through language as well as quantitative structural characteristics. In this review, we provide an extensive analysis of recent advancements achieved through cross modeling of biomolecules and natural language.


posted an update about 1 year ago
view post
Post
🚀⭐️Introducing our new survey "Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey"

arxiv: https://arxiv.org/abs/2403.01528
github: https://github.com/QizhiPei/Awesome-Biomolecule-Language-Cross-Modeling

The integration of biomolecular modeling with natural language (BL) has emerged as a promising interdisciplinary area at the intersection of artificial intelligence, chemistry and biology. This approach leverages the rich, multifaceted descriptions of biomolecules contained within textual data sources to enhance our fundamental understanding and enable downstream computational tasks such as biomolecule property prediction. The fusion of the nuanced narratives expressed through natural language with the structural and functional specifics of biomolecules described via various molecular modeling techniques opens new avenues for comprehensively representing and analyzing biomolecules. By incorporating the contextual language data that surrounds biomolecules into their modeling, BL aims to capture a holistic view encompassing both the symbolic qualities conveyed through language as well as quantitative structural characteristics. In this review, we provide an extensive analysis of recent advancements achieved through cross modeling of biomolecules and natural language.


reacted to their post with ❤️ about 1 year ago
view post
Post
BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations

BioT5 achieves superior performance on various biological tasks by integrating natural language. See more details at:

Paper: BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations (2310.07276)
Code: https://github.com/QizhiPei/BioT5
Model (base): QizhiPei/biot5-base
Model (molecule captioning): QizhiPei/biot5-base-mol2text
Model (Text-based Molecule Design): QizhiPei/biot5-base-text2mol
  • 1 reply
·
posted an update about 1 year ago
view post
Post
BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations

BioT5 achieves superior performance on various biological tasks by integrating natural language. See more details at:

Paper: BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations (2310.07276)
Code: https://github.com/QizhiPei/BioT5
Model (base): QizhiPei/biot5-base
Model (molecule captioning): QizhiPei/biot5-base-mol2text
Model (Text-based Molecule Design): QizhiPei/biot5-base-text2mol
  • 1 reply
·