top of page

Negar Arabzadeh

Postdoctoral Fellow at UC Berkeley

This is Negar Arabzadeh. I am a postdoctoral researcher at UC Berkeley, working with Professor Matei Zaharia. I completed my PhD at the University of Waterloo, where I was advised by Dr. Charles Clarke. My research lies at the intersection of information retrieval and the evaluation of—and with—large language models (LLMs) in information access systems.

August 2025 

🌟  First project  release as a postdoc at Sky Lab, a Stanford + Berkeley collaboration!  DeepScholar-Bench, a live benchmark for generative research synthesis.

🪑I will be serving as Workshop chair for ECIR 2026.

📚 3 Papers get accepted at CIKM 2025:

  • RottenReviews: Benchmarking Review Quality with Human and LLM-Based Judgments

    • w/ Sajad Ebrahimi, Soroush Sadeghian, Ali Ghorbanpour, Sara Salamat, Muhan Li, Hai Son Le, Mahdi Bashari and Ebrahim Bagheri​

  • Building Trustworthy Peer Review Quality Assessment Systems

    • w/ Sajad Ebrahimi, Ali Ghorbanpour, Soroush Sadeghian, Sara Salamat, Muhan Li, Hai Son Le, Mahdi Bashari and Ebrahim Bagheri​

  • LLM-as-a-Judge in Entity Retrieval: Assessing Explicit and Implicit Relevance

    • w/ Mohammad Hossein Saliminabi, Seyed Mohammad Hosseini, Dimitrios Androutsos, Morteza Zihayat and Ebrahim Bagheri

July 2025

✈️I attended SIGIR in Padua,Italy.

🏅I won best reviewer award at SIGIR.

June 2025 

🪑I will be serving as Publicity chair for CHIIR 2026.

🏆 I was honored to receive SIGIR student travel grant.

May 2025

🎓 I have successfully defended my PhD! 

April 2025

🌟I moved to California and officially started my postdoc at UC Berkeley — can’t believe this dream is now reality!

📚Four papers got accepted in SIGIR:

  • Benchmarking LLM-based Relevance Judgment Methods

    • w/ Charles Clarke

  • VAP3: Variation-Aware Prompt Performance Prediction

    • w/ Ebrahim Bagheri​

  • A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment​

    • w/ Charles Clarke

  • IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents

    • w/ Shrestha Mohanty,  Andrea Tupini, Yuxuan Sun, Alexey Skrynnik, Artem Zholus, Marc-Alexandre Cote and Julia Kiseleva​

 

December 2024

✈️I attended SIGIR-AP in Tokyo,Japan. 

  • We presented two papers and a tutorial.

✈️I attended ACML in Hanoi,Vietnam. 

  • We presented two papers.

📚Two papers got accepted in ECIR:

  • ​exHarmony: Authorship and Citations for Benchmarking the Reviewer Assignment Problem

    • w/Sajad Ebrahimi, Sara Salamat, Mahdi Bashari and Ebrahim Bagheri​

  • Benchmarking Prompt Sensitivity in Large Language Models

    • w/ Amirhosein Razavi, Mina Soltangheis, Sara Salamat, Morteza Zihayat and Ebrahim Bagheri​

bottom of page