Başlangıç ve Bitiş Tarihi

01 Kasım 2023-31 Ekim 2028

Koordinatör

Boğaziçi Üniversitesi

Bütçe

1.982.800 €

Desteklendiği Program ve Alan
Avrupa Araştırma Konseyi
Desteklendiği Çerçeve Program

Ufuk Avrupa

Projenin CORDIS Linki
LifeLU
Arzucan Özgür ERC

Proteinler, yaşamı sürdüren biyolojik süreçlerde önemli bir rol oynar. Üç boyutlu moleküller olmalarına rağmen, amino asit dizileri olarak metinsel olarak da temsil edilebilmektedirler. Protein dizileri, yaşamın dili olarak da ifade edilen belirli bir dille yazılmış metinler olarak düşünülebilir. Doç. Dr. Arzucan ÖZGÜR ve ekibi LifeLU Projesi kapsamında insan dillerindeki kelimelere benzer şekilde, proteinlerin en küçük anlamlı birimlerini ve bu birimlerin anlamlarını (işlevlerini) tespit etmek için yenilikçi yöntemler geliştirmeyi ve yaşamın dilini daha iyi anlayabilmek için bir temel oluşturmayı hedeflemektedir. 

ÖZGÜR'ün proje önerisi TÜBİTAK ERC Baş Araştırmacı Geliştirme Programı (EBAG) kapsamında desteklenmiştir. 


Çağrı: ERC-2022-CoG

Proje Süresi: 5 yıl

Proje Akonimi: LifeLU

Proje Başlığı: Understanding the Language of Life: Identifying and Characterizing the Language Units in Protein Sequences

Proje Numarası: 101089287

Ev Sahibi Kurum: Boğaziçi Üniversitesi

Panel: PE6 - Computer Science and Informatics

İlgili ERC Proof of Concept Projesi: -


Understanding the sequences of proteins

Proteins, crucial for life, can be depicted textually as sequences of amino acids. Although the ‘language of life’ remains incompletely understood, breakthroughs in studying proteins have been made possible through natural language processing. The ERC-funded LifeLU project is dedicated to advancing language processing research and delving into new frontiers in comprehending the language of life. The project will pioneer innovative methods to identify the language units within the language of life and explore the characteristics and variability among different species. Additionally, it aims to create novel techniques for identifying and analysing the functions of these language units. LifeLU endeavours to decode the language of life, with the ultimate goal of developing innovative approaches for disease prevention, diagnosis and treatment.

Objective

"Proteins play a key role in biological processes that govern and maintain life. Although they are three-dimensional entities, they can be represented in textual form as sequences of amino acids that largely determine their structures and functions. By analogy with natural (human) languages, we can consider proteins as written with a language, which we refer to in this proposal as the ""language of life"". Natural languages can be read and understood by humans. However, we cannot yet understand the language of life. We do not even know what the vocabulary is, i.e. what the basic language units are (analogous to words in human languages). Textual representation of proteins has enabled the application of natural language processing (NLP) techniques to the study of proteins, and breakthrough results have been achieved in various downstream tasks such as protein structure prediction. However, these efforts remain only at the ""processing level"" of the language of life. The main goal of this project is to go beyond the level of language processing and open new research horizons for understanding the language of life. Using my expertise in NLP and bioinformatics, I will pursue the following objectives: (i) develop innovative methods to determine the language units (i.e. the vocabulary) of the language of life; (ii) identify the characteristics of this language as well as its variability among species; (iii) develop novel methods to identify and characterize the functions of the language units. This research will lay the foundation for a new field of research, molecular language understanding, which aims to develop methods for understanding the messages encoded in molecular sequences. The ultimate goal of this project is to decipher the language of life, which will lead to groundbreaking consequences for understanding life and health, and will shed light to the development of novel prevention, diagnosis, and treatment strategies for diseases."

(Kaynak: CORDIS)

BAŞARI HİKAYELERİ