Computational Approaches to Arabic Script-based Languages

[    Home |   Contact   ]       



   Arabic Script Languages

   Workshop Proceedings

   Organizing Committee

   Tools

   Mailing List



Tools and Resources Presented at CAASL Workshops



Computational Tools

  • Urdu
      - Urdu Morphology, includes Urdu resources such as source code, thesis report, technical manuals and an online analyzer tool for users.
      (courtesy of Muhammad Humayoun)
  • Arabic
  • Persian
      - Hamshahri corpus, consisting of 345 MB of news texts from the Hamshahri newspaper, developed for information retrieval reserach.
      - Bijankhan corpus, a manually tagged corpus of about 2.6 million words gathered from daily news and commone texts, suitable for natural language processing research.
      (courtesy of Hadi Amiri)

    Educational Tools

  • Persian unicode editor
    (courtesy of Behdad Esfahbod)
  • Persian Verb conjugator
    (created by Artem Lukanin, Connie Bobroff, and Ali Jahanshiri)