By Jian Wu and Tiffany Whitfield

吴健博士.D., assistant professor of Computer Science at Old Dominion University is at the forefront of innovation and big data. Recently he utilized his knowledge to present his work at one of the leading international academic conferences on artificial intelligence in the world. 2月22日, 2024, Wu presented a paper titled, “ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations” at the 36th Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-24), collocated with the 38th Annual AAAI Conference on Artificial Intelligence (AAAI) in Vancouver, Canada.

今年, more than 5,000 people from many continents and countries attended the conference. Compared with AAAI, which focuses more on theoretical contributions, IAAI focuses on application of AI to real-world scenarios. 今年, the acceptance rate of IAAI was 24%, making it one of the most competitive conferences on AI applications.

The first author of the paper Dr. Wu presented as well as Muntabir Choudhury, a senior Ph.D. student in Computer Science at ODU. This paper proposes a novel method to classify PDF pages of electronic theses and dissertations (ETDs) into 13 different categories, 比如章节, 参考文献, 附录, 还有标题页. 新方法, called the multimodality model, leverages a deep neural network to fuse text and visual information into a single representation. This method achieved a much higher performance when compared with state-of-the-art methods, which were only based on either text or visual information. The new method improved the accuracy by at least 25%. This work laid a foundation to build a user-friendly online reader for ETDs. Instead of downloading and reading a lengthy ETD on a computer, users can directly navigate to the sections they are interested in. Dr. Wu said, “Muntabir is an excellent student, and I am glad his two year’s effort finally paid off.”

This method is partially sponsored by a research grant awarded by the Institute of Museum and Library 服务s. In addition to Muntabir and Wu, other participants included Lamia Salsabil, a graduate student at ODU, 爱德华·福克斯, Ph.D., professor of Computer Science at Virginia Tech, and Bill Ingram, Ph.D., assistant professor and associate dean and executive director for information technologies in the University Libraries at Virginia Tech.

Several current and previous professors from ODU attended the conference, including Dr. Jiang Li (ECE, ODU), Dr. Hongyi Wu (University of Arizona), and Dr. Wu He (National Science Foundation).