I am an AI Researcher working on conversational AI and OpenGPT-X at Fraunhofer IAIS in Germany. My major research interest centers around investigating and comprehending the reasoning and planning aptitudes displayed by the new generation of foundational language models. Additionally, I explore diverse novel applications and evaluation systems leveraging large language models.
I successfully obtained my Master’s degree from the Department of Computer Science, Saarland University in Germany. Before that, I served as a research student at the Max Planck Institute for Informatics, focusing on ad-hoc neural information retrieval systems and natural language processing. Additionally, I had the opportunity to work as a visiting student researcher at the Laboratory for Computational Social Systems, IIIT Delhi.
I completed my Master’s thesis titled “Retrieval augmented generative task-oriented dialogue systems for faithful knowledge grounding” with the Chair of Computer Science and Computational Linguistics and Fraunhofer IAIS. I was fortunate to be advised by Prof. Dr. Vera Demberg and Prof. Dr. Jens Lehmann. At my previous research position (Immersion Lab), I worked with the Database and Information Systems Group where my work was supervised by Dr. Erisa Terroli. During my Masters, I have also worked as a research assistant at German Research Center for AI, Saarbrücken, and at Fraunhofer IZFP, Saarbrücken.
My interests are in building robust machine learning models for scalable and reliable AI systems, spanning the areas of natural language processing, information retrieval, knowledge bases and knowledge graphs-based reasoning, and graph machine learning. Currently, I am involved in supporting the training of foundational LLMs like OpenGPTx and TrustLLM, with a particular focus on investigating their trustworthiness and exploring dimensions of reasoning, planning, and calibration/alignment.
I am currently looking for potential fully-funded PhD positions to work on challenging and interesting problems in my topic of interest. I am also open to R&D full-time and research residency positions where I can utilize my creative problem-solving and programming skills.
MSc, Department of Computer Science, 2022
Saarland University, Saarbrücken, Germany
BTech, Electronics and Telecommunication Engineering, 2016
IIIT Bhubaneswar, India
5+ years experience in coding with Python and ML libraries
Experience through relevant coursework, projects and research work
Experience through industry projects, related course and research work
Experience through industry projects, related course and research work
Experience in writing big data algorithms and visualizations
Experience in working on industrial and academic projects
Graduate student in the department of computer science with a strong focus on machine learning and information systems.
Undergraduate student in the department of electronics engineering with a focus towards signal processing.
Studying social networks for exploring structural and behavioural properties using graph mining and NLP.
Research student working in the topics of neural information retrieval systems, with a focus on health domain.
Worked at the department of Algorithm, Signal, and Data Processing.
Worked as a part of the project on MMPE: A Multi-Modal Interface for Post-Editing Machine Translation.
Worked as part of the Data Science team, with a focus on Information Systems.
Worked as part of the DataOps Analytics team and Cloud Infrastructure R&D team.
Here we try to analyze different approaches of online tweet classification into offensive and non-offensive categories in the presence of emoticons. We try to enhance the tokenization process to capture better sparsity in the presence of non-textual corpus.
This is an unsupervised learning based lustering web tool that has been used to cluster microstructures in Steel based on their physical properties that have been extracted using classical image processing algorithms. Files can then be uploaded in .xlsx or .csv format. The tool supports five different clustering algorithms in combination with dimensionality reduction.
In this work, we propose using U-net and ACGAN as a learning framework for feature generation of medical images followed by classification to validate the quality of generated features.
Since health forums become a rich source of information to people with medical conditions discussing treatments, doctor’s opinions, side-effects to complex-drugs, while also sharing personal background medical information in a community question-answering framework, we develop a neural search engine on top of such health forums by exploring the state-of-the-art neural ranking models. We first write a set of optimal heuristic functions that maximizes the relevancy scores for a labelled dataset by training a snorkel classifier that classifies a given query-document pair as relevant or irrelevant. Later, these functions are extended to classify the unlabelled set of query-document pairs, followed by re-ranking using neural re-rankers.
We implement k-nearest neighbors, Gaussian Mixture Model, Multi-class SVM, Convolutional Neural Network, and Convolutional Recurrent Neural Network to classify the following four genres- Dark-Forest, Hi-Tech, Full-On, and Goa. We further extract 30 temporal features using a Long Short Term Memory based Auto encoder from individual frames, and augment them with the frame-level audio features, which is a novel contribution in this work.
Study on Robustness of Automated Essay Scoring Systems to Out-of-domain and Adversarial Inputs summary
Attended the Max Planck Intersectional Symposium on Computing and Society (January 2021).
Attended the 2020 Neural Information Processing Systems (NeurIPS ) (December 2020).
Phonologically, my name is pronounced as ‘Soumy’ Ranjan Sahoo | ‘सौम्य’ रंजन साहू | ‘ସୌମ୍ୟ’ ରଂଜନ ସାହୁ