Keynotes
More information will be posted here as it becomes available.
The Road to Immersive Multimedia Communications
John Apostolopoulos, HP Labs
8:40am - 9:40am, Tuesday, Jun 30
The vision of immersive communication is to enable natural and comfortable experiences and interactions between people in different locations. This vision has been seeded in science fiction and has been a goal for scientists and engineers for many years. Recent high-quality video conferencing systems, such as HP’s Halo system, are finally beginning to deliver on the decades old promise of a natural and effective video conferencing experience. Similar efforts are underway to transform, for example, the desktop and shared room settings. This is a result of advances in a number of intertwined research areas including video and audio signal processing, computer vision, graphics, networking, and very importantly, experience design which is aimed at understanding the key elements necessary for a natural and effective communication experience. This talk will examine current and future directions in immersive multimedia communication for human-to-human communication and collaboration. We will highlight some of the key advances behind the success of current systems, as well as open problems whose solutions may help create the next generation of immersive communication systems.
Presenter’s biography:John Apostolopoulos is the Director of the Multimedia Communications and Networking Lab (MCNL) and a Distinguished Technologist at HP Labs. The goals of MCNL are (1) to create compelling networked media experiences that fundamentally change how people communicate, collaborate, socialize and entertain, and (2) to create the intelligent infrastructure which provides predictable, high-quality and power-efficient networking to support current and future applications. He received his B.S., M.S., and Ph.D. degrees in EECS from MIT. In graduate school, he worked on the U.S. Digital TV standard and received an Emmy Award Certificate for his contributions. Recently, his work on transcoding in the middle of a network while preserving end-to-end security (secure transcoding) was adopted by the JPEG-2000 Security (JPSEC) standard. He was named “one of the world’s top 100 young (under 35) innovators in science and technology” (TR100) by MIT Technology Review in 2003, and an IEEE Fellow in 2008. He currently serves as chair of the IEEE Image, Video, and Multidimensional Signal Processing (IVMSP) technical committee, member of the Multimedia Signal Processing (MMSP) technical committee, and recently was general co-chair of VCIP'06, and technical co-chair for IEEE ICIP'07. He enjoys teaching and conducts joint research at Stanford University, where he is a Consulting Associate Professor of EE, and is a frequent visiting lecturer at MIT EECS. His primary research interests include improving the quality, reliability, scalability and security of multimedia communications over wired and wireless packet networks.
Image/Video Classification and Search: Addressing Semantic Gap and User Gap
Shih-Fu Chang, Columbia University
8:40am - 9:40am, Wednesday, July 1
Finding information from large image/video sources poses a challenging yet exciting problem. Two well-known barriers preventing successful solutions to date are the semantic gap and the user gap. The former refers to the difficulty in inferring high-level semantic labels from low-level pixel data. The latter reflects user’s frustration in expressing his/her information needs of visual content using existing systems. In this talk, I will present recent research trends aiming at solving both problems. First, I will survey various approaches (expert driven, linguistics based, and user driven) taken in defining large-scale multimedia concept ontologies, covering thousands of visual concepts from different categories (scene, object, event, people etc). I will then discuss the state of the art and open issues of automatic concept categorization, which if successful, promise to fill the semantic gap. I will emphasize issues of learning from a large pool of data with limited or imperfect labels (e.g., data from Internet). Second, to address the user gap, I will present efforts in developing intuitive interactive tools for video search. I will demonstrate a novel paradigm with unique emphasis on helping users to make sense of visual data throughout the search process, starting from initial query formulation to real-time navigation of the concept space and result sets simultaneously.
Presenter’s biography: Shih-Fu Chang is Professor and Chairman of Electrical Engineering and Director of Digital Video and Multimedia Lab at Columbia University. He received a BS degree in Electrical Engineering from National Taiwan University in 1985 and a PhD degree in EECS from U.C. Berkeley in 1993.
Chang has pioneered development of techniques and systems for content-based visual retrieval, video content analysis, compressed-domain video manipulation, and adaptive media streaming. In 1996, he and his students developed several of the first search engines for digital images and videos, such as VisualSEEk, VideoQ, and WebSEEk, using fully automated content-based analysis. Works from his group are influential in shaping the foundation of this active field. His other contributions include development of new visual search paradigms, large-scale object recognition models (e.g., Columbia374), international multimedia indexing standards (e.g., MPEG-7), and large multimedia ontologies (e.g., LSCOM). He has expanded his recent research into the emerging area of Internet Media Computing, emphasizing exploration of large amount of online media data, metadata, and use behaviors for expediting progresses in traditional fields like image search and computer vision.
Chang has received a number of awards, including IEEE Kiyo Tomiyasu Award (2009), Navy ONR Young Investigator Award, IBM Faculty Award, ACM Recognition of Service Award, NSF CAREER Award, and three Best Paper Awards from IEEE, ACM, and SPIE. Several students supervised by him have also received Best Student Paper Awards. He is a Fellow of IEEE and former Editor-in-Chief of the IEEE Signal Processing Magazine (2006-8). He has had fruitful interaction with industry, via directing Columbia’s ADVENT industry-university research consortium (2000-3), serving on technical advisory boards, and transferring more than ten technologies to companies.
Parallel Computing Revolution in Video Processing,
Wen-mei W. Hwu, University of Illinois at Urbana-Champaign
8:40am - 9:40am, Thursday, July 2
With the pervasive presence of low-cost, high quality cameras on cell phones and laptops, we are witnessing the beginning of an explosive growth of video applications. In the near future, these applications will allow us to better manage images and videos. In the long run, they will likely revolutionize our interactions with computers, game consoles, mobile devices and each other. Innovations in these applications will require orders of magnitude increase in computing speed than we have today. Recent advances in GPUs such as the NVIDIA GeForce GTX280 have transformed them into massively parallel, generally programmable many-core processors. Today, application developers for these many-core chips are reporting 10X-100X speedup over sequential code on traditional microprocessors. According to the semiconductor industry roadmap, these processors could scale up to over 1,000X speedup over single cores by the end of the year 2016. I will present several exciting efforts in using massively parallel processors to add innovations to the way we analyze, enhance, and synthesize videos for use in end-user applications. I will also address the interaction between algorithm design, programming and the parallel platform features to point to the areas that we will likely have new breakthroughs in the future.
Presenter’s biography: Wen-mei W. Hwu is a Professor and holds the Sanders-AMD Endowed Chair in the Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign. His research interests are in the area of architecture, implementation, and software for high performance computer systems. He is the director of the IMPACT research group. For his contributions in research and teaching, he received the ACM SigArch Maurice Wilkes Award, the ACM Grace Murray Hopper Award, the Tau Beta Pi Daniel C. Drucker Eminent Faculty Award, and ISCA Most Influential Paper Award. He is a fellow of IEEE and ACM. Hwu serves on the Executive Committee of the MARCO/DARPA C2S2 and GSRC Focus Research Centers. He leads the GSRC Concurrent Systems Theme and directs the CUDA Center of Excellence at UIUC. He also co-directs the new $18M UIUC Intel/Microsoft Universal Parallel Computing Research Center with Marc Snir and serves as one of the principal investigators of the $208M NSF Blue Waters Petascale computer project. Dr. Hwu received his Ph.D. degree in Computer Science from the University of California, Berkeley.
