• ARABIC QUESTION ANSWERING ON THE HOLY QUR'AN 

      MALHAS, RANA R. (2023 , Dissertation)
      In this dissertation,we address the need for an intelligent machine reading at scale (MRS) Question Answering (QA) system on the Holy Qur'an, given the permanent interest of inquisitors and knowledge seekers in this sacred ...
    • ArabicWeb16: A new crawl for today's Arabic Web 

      Suwaileh, Reem; Kutlu, Mucahid; Fathima, Nihal; Elsayed, Tamer; Lease, Matthew ( Association for Computing Machinery, Inc , 2016 , Conference Paper)
      Web crawls provide valuable snapshots of the Web which enable a wide variety of research, be it distributional analysis to characterize Web properties or use of language, content analysis in social science, or Information ...
    • ArCov-19: The First Arabic COVID-19 Twitter Database with Propagation Networks 

      Haouari, Fatima; Hasanain, Maram; Suwaileh, Reem; Elsayed, Tamer ( Cornel University , 2020 , Article  &   Video)
      In this paper, we present ArCOV-19, an Arabic COVID-19 Twitter dataset that covers the period from 27th of January till 31st of March 2020. ArCOV-19 is the first publicly-available Arabic Twitter dataset covering COVID-19 ...
    • BIGIR at CLEF 2019: Automatic verification of Arabic claims over the web 

      Haouari, Fatima; Ali, Zien Sheikh; Elsayed, Tamer ( CEUR-WS , 2019 , Conference Paper)
      With the proliferation of fake news and its prevalent impact on democracy, journalism, and public opinions, manual fact-checkers become unscalable to the volume and speed of fake news propagation. Automatic fact-checkers ...
    • Building a Test Collection for Significant-Event Detection in Arabic Tweets 

      Almerekhi, Hind Ali (2016 , Master Thesis)
      With the increasing popularity of microblogging services like Twitter, researchers discov- ered a rich medium for tackling real-life problems like event detection. However, event detection in Twitter is often obstructed ...
    • CloudFlow: A data-aware programming model for cloud workflow applications on modern HPC systems 

      Zhang, Fan; Malluhi, Qutaibah M.; Elsayed, Tamer; Khan, Samee U.; Li, Keqin; ... more authors ( Elsevier , 2015 , Article)
      Traditional High-Performance Computing (HPC) based big-data applications are usually constrained by having to move large amount of data to compute facilities for real-time processing purpose. Modern HPC systems, represented ...
    • Crowd vs. Expert: What can relevance judgment rationales teach us about assessor disagreement? 

      Kutlu, M.; Kutlu, Mucahid; McDonnell, Tyler; Barkallah, Yassmine; Elsayed, Tamer; ... more authors ( ACM , 2018 , Conference Paper)
      © 2018 ACM. While crowdsourcing offers a low-cost, scalable way to collect relevance judgments, lack of transparency with remote crowd work has limited understanding about the quality of collected judgments. In prior work, ...
    • DART: A large dataset of dialectal Arabic tweets 

      Alsarsour, Israa; Mohamed, Esraa; Suwaileh, Reem; Elsayed, Tamer ( European Language Resources Association (ELRA) , 2019 , Conference Paper)
      In this paper, we present a new large manually-annotated multi-dialect dataset of Arabic tweets that is publicly available. The Dialectal ARabic Tweets (DART) dataset has about 25K tweets that are annotated via crowdsourcing ...
    • DID I SEE IT BEFORE? RETRIEVING PREVIOUSLY CHECKED CLAIMS OVER TWITTER 

      MANSOUR,WATHEQ AHMAD (2022 , Master Thesis)
      With the proliferation of fake news in the last few years, especially during COVID- 19, combating the spread of misinformation has become a social and political urgent need. Fact-checkers and journalists need to identify ...
    • ENABLING EFFECTIVE ARABIC INFORMATION RETRIEVAL ON THE WEB AND SOCIAL MEDIA 

      HASANAIN, MARAM GHANEM (06-2 , Dissertation)
      Arabic is one of the most dominant languages on the Web and social media. The huge and ever-growing Arabic user generated content, further motivated by the ongoing political unrest in the region, created an immense need ...
    • EveTAR: A new test collection for event detection in Arabic tweets 

      Almerekhi, Hind; Hasanain, Maram; Elsayed, Tamer ( Association for Computing Machinery, Inc , 2016 , Conference Paper)
      Research on event detection in Twitter is often obstructed by the lack of publicly-available evaluation mechanisms such as test collections; this problem is more severe when considering the scarcity of them in languages ...
    • LOCATION MENTION PREDICTION FROM DISASTER TWEETS 

      SUWAILEH, REEM ALI (2023 , Dissertation)
      While utilizing Twitter data for crisis management is of interest to different response authorities, a critical challenge that hinders the utilization of such data is the scarcity of automated tools that extract and resolve ...
    • ON RELEVANCE FILTERING FOR REAL-TIME TWEET SUMMARIZATION 

      SUWAILEH, REEM ALI (2018 , Master Thesis)
      Real-time tweet summarization systems (RTS) require mechanisms for capturing relevant tweets, identifying novel tweets, and capturing timely tweets. In this thesis, we tackle the RTS problem with a main focus on the relevance ...
    • On the evaluation of tweet timeline generation task 

      Magdy, Walid; Elsayed, Tamer; Hasanain, Maram ( Springer Verlag , 2016 , Conference Paper)
      Tweet Timeline Generation (TTG) task aims to generate a timeline of relevant but novel tweets that summarizes the development of a given topic. A typical TTG system first retrieves tweets then detects novel tweets among ...
    • Overview of the CLEF-2019 Checkthat! LAB: Automatic identification and verification of claims. Task 2: Evidence and factuality 

      Hasanain, Maram; Suwaileh, Reem; Elsayed, Tamer; Barrón-Cedeño, Alberto; Nakov, Preslav ( CEUR-WS , 2019 , Conference Paper)
      We present an overview of the second edition of the CheckThat! Lab at CLEF 2019. The lab featured two tasks in two different languages: English and Arabic. Task 1 (English) challenged the participating systems to predict ...
    • QU-IR at SemEval 2016 Task 3: Learning to rank on Arabic community question answering forums with word embedding 

      Malhas, Rana; Torki, Marwan; Elsayed, Tamer ( Association for Computational Linguistics (ACL) , 2016 , Conference Paper)
      Resorting to community question answering (CQA) websites for finding answers has gained momentum in the past decade with the explosive rate at which social media has been proliferating. With many questions left unanswered ...
    • Query performance prediction for microblog search 

      Hasanain, Maram; Elsayed, Tamer ( Elsevier Ltd , 2017 , Article)
      Query performance prediction (QPP) is the task of estimating the effectiveness of a retrieval system given a search query in the absence of any feedback from the searcher. The task has been proven to be very challenging, ...
    • Real-time Tweet Summarization Mobile Application 

      Salim, Nazar S. (2018 , Professional Masters Project)
      With the emergence of the massive volume of content through social media platforms, users are getting overwhelmed with information, though searching for the topic will give you filtered information that interests you. ...
    • SparkIR: a Scalable Distributed Information Retrieval Engine over Spark 

      Al-Rasbi, Sara Yaqoob (2020 , Master Thesis)
      Search engines have to deal with a huge amount of data (e.g., billions of documents in the case of the Web) and find scalable and efficient ways to produce effective search results. In this thesis, we propose to use ...
    • Unsupervised adaptive microblog filtering for broad dynamic topics 

      Magdy, Walid; Elsayed, Tamer ( Elsevier , 2016 , Article)
      Information filtering has been a major task of study in the field of information retrieval (IR) for a long time, focusing on filtering well-formed documents such as news articles. Recently, more interest was directed towards ...