Sahar Nasirihaghighi, Negin Ghamsarian, Raphael Sznitman, and Klaus Schoeffmann. 2025. Dual Invariance Self-Training for Reliable Semi-Supervised Surgical Phase Recognition. In Proceedings of the 2025 IEEE 22nd International Symposium on Biomedical Imaging (ISBI), IEEE, 5 pages.
Sahar Nasirihaghighi, Negin Ghamsarian, Heinrich Husslein, and Klaus Schoeffmann. 2024. Event Recognition in Laparoscopic Gynecology Videos with Hybrid Transformers. In Proceedings of the 30th International Conference on Multimedia Modeling (MMM 2024). Lecture Notes in Computer Science (LNCS), vol tbd, Springer, Cham. 14 pages, to appear
Vignesh V Menon, Reza Farahani, Prajit Rajendran, Samira Afzal, Klaus Schoeffmann, and Christian Timmerer. 2023. Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Streaming. In Proceedings of the IEEE International Conference on Visual Communications and Image Processing (VCIP 2023), IEEE, Los Alamitos, CA, USA, 1-5. DOI: 10.1109/VCIP59821.2023.10402699
Vignesh V. Menon, Prajit T. Rajendran, Reza Farahani, Klaus Schoeffmann, and Christian Timmerer. 2024. Video Quality Assessment with Texture Information Fusion for Streaming Applications. In Proceedings of the 3rd Mile-High Video Conference (MHV ’24). Association for Computing Machinery, New York, NY, USA, 1–6.
Vignesh V. Menon, Jingwen Zhu, Prajit T. Rajendran, Samira Afzal, Klaus Schoeffmann, Patrick Le Callet, and Christian Timmerer. 2024. Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low latency Encoding. In Proceedings of the 3rd Mile-High Video Conference (MHV ’24). Association for Computing Machinery, New York, NY, USA, 61–67.
Markus Wieser, Klaus Schöffmann, Daniela Stefanics, Andreas Bollin, and Stefan Pasterk. 2023. Investigating the Role of ChatGPT in Supporting Text-Based Programming Education for Students and Teachers. In Proceedings of the 16th International Conference on Informatics in Schools (ISSEP 2023). Springer LNCS, Heidelberg. 40-53. doi: 10.1007/978-3-031-44900-0_4
Negin Ghamsarian, Javier Gamazo Tejero, Pablo Márquez Neila, Sebastian Wolf, Martin Zinkernagel, Klaus Schoeffmann, Raphael Sznitman. 2023. Domain Adaptation for Medical Image Segmentation using Transformation-Invariant Self-Training. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention 2023 (MICCAI 2023), Springer LNCS, Heidelberg, 331-341, doi: 10.1007/978-3-031-43907-0_32
Sahar Nasirihaghighi, Negin Ghamsarian, Daniela Stefanics, Klaus Schoeffmann and Heinrich Husslein. 2023. Action Recognition in Video Recordings from Gynecologic Laparoscopy. In Proceedings of the IEEE 36th International Symposium on Computer-Based Medical Systems (CBMS) 2023, IEEE, Los Alamitos, CA, 29-34, doi: 10.1109/CBMS58004.2023.00187
Vignesh V Menon, Prajit T Rajendran, Christian Feldmann, Martin Smole, Klaus Schoeffmann, Mohammad Ghanbari, and Christian Timmerer. 2023. Perceptually-aware Live VBR Encoding Scheme for Adaptive AVC Streaming. 2023 NAB Broadcast Engineering and Information Technology (BEIT) Conference. Las Vegas, USA.
Natalia Mathá, Klaus Schoeffmann, Stephanie Sarny, Doris Putzgruber-Adamitsch and Yosuf El-Shabrawi. Evaluation of Relevance-Driven Compression of Regular Cataract Surgery Videos. 2022. In Proceedings of the IEEE 35th International Symposium on Computer Based Medical Systems (CBMS), July 21-23, Shenzen, China. 429-434. doi: 10.1109/CBMS55023.2022.00083
Negin Ghamsarian, Mario Taschwer, Raphael Sznitman and Klaus Schoeffmann. DeepPyram: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos. 2022. In Proceedings of the 25th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2022). Lecture Notes in Computer Science, Vol 13435, Springer International Publishing, September 18-22, 2022, Singapore, 276-286. doi: 10.1007/978-3-031-16443-9_27
Markus Fox and Klaus Schoeffmann. The Impact of Dataset Splits on Classification Performance in Medical Videos. 2022. In Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR) 2022, ACM, New York, NY, USA, 6-10. doi: 10.1145/3512527.3531424.
Jakub Lokoc, Luca Rossetto, Werner Bailer, Klaus Schoeffmann, Stefanos Vrochidis, Cathal Gurrin, Silvan Heller, Lucia Vadicamo, Kai Uwe Barthel, Ladislav Peska, Jiaxin Wu, and Björn Thor Jonsson. 2022. A Task Category Space for User-Centric Comparative Multimedia Search Evaluations. In Proceedings of the 28th International Conference on Multimedia Modeling (MMM 2022). Lecture Notes in Computer Science (LNCS), vol 13141, Springer, Cham. 193-204. doi: 10.1007/978-3-030-98358-1_16.
Negin Ghamsarian, Mario Taschwer, Doris Putzgruber-Adamitsch, Stephanie Sarny, Yosuf El-Shabrawi and Klaus Schoeffmann. 2021. ReCal-Net: Joint Region-Channel-Wise Calibrated Network for Semantic Segmentation in Cataract Surgery Videos. In Proceedings of the 28th International Conference on Neural Information Processing 2021 (ICONIP). 391-402. doi: 10.1007/978-3-030-92238-2_33
Negin Ghamsarian, Mario Taschwer, Doris Putzgruber-Adamitsch, Stephanie Sarny, Yosuf El-Shabrawi and Klaus Schoeffmann. 2021. LensID: A CNN-RNN-Based Framework Towards Lens Irregularity Detection in Cataract Surgery Videos. In Proceedings of the 24th International Conference on Medical Image Computing & Computer Assisted Intervention 2021 (MICCAI 2021). 76-86. doi: 10.1007/978-3-030-87237-3_8.
P. Steinkellner and K. Schöffmann. 2021. Evaluation of Object Detection Systems and Video Tracking in Skiing Videos. Proceedings of the 2021 International Conference on Content-Based Multimedia Indexing (CBMI 2021), pp. 1-6, doi: 10.1109/CBMI50038.2021.9461905.
Negin Ghamsarian, Mario Taschwer, Doris Putzgruber, Stephanie Sarny, and Klaus Schoeffmann. 2020. Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization. Proceedings of the 25th International Conference on Pattern Recognition (ICPR 2020). IEEE, Los Alamitos, CA, USA, pp. 10720-10727, doi: 10.1109/ICPR48806.2021.9412525.
Negin Ghamsarian, Hadi Amirpour, Christian Timmerer, Mario Taschwer, and Klaus Schoeffmann. 2020. Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Neural Networks. In Proceedings of the ACM International Conference on Multimedia (ACMMM) 2020, ACM, New York, NY, USA, 3577-3585. doi 10.1145/3394171.3413658
Markus Fox, Mario Taschwer, Klaus Schoeffmann. 2020. Pixel-Based Tool Segmentation in Cataract Surgery Videos with Mask R-CNN. Proceedings of the IEEE 33rd International Symposium on Computer Based Medical Systems (CBMS 2020). IEEE, Los Alamitos, CA, USA, 568-571. doi 10.1109/CBMS49503.2020.00112.
Negin Ghamsarian, Mario Taschwer, Klaus Schoeffmann. 2020. Deblurring Cataract Surgery Videos Using a Multi-Scale Deconvolutional Neural Network. Proceedings of the IEEE International Symposium on Biomedical Imaging 2020 (ISBI2020). IEEE, Los Alamitos, CA, USA, 872-876. doi 10.1109/ISBI45749.2020.9098318.
Sabrina Kletz, Klaus Schoeffmann, Andreas Leibetseder, Jenny Benois-Pineau, Heinrich Husslein. 2020. Instrument Recognition in Laparoscopy for Technical Skill Assessment. Proceedings of the 26th International Conference on Multimedia Modeling 2020 (MMM2020), Lecture Notes in Computer Science, Vol 11962, Springer International Publishing, Cham, 589-600.
Andreas Leibetseder, Sabrina Kletz, Klaus Schoeffmann, Simon Keckstein, and Jörg Keckstein. 2020. GLENDA: Gynecologic Laparoscopy Endometriosis Dataset. Proceedings of the 26th International Conference on Multimedia Modeling 2020 (MMM2020). Lecture Notes in Computer Science, Vol 11962, Springer International Publishing, Cham, 439-450.
Natalia Sokolova, Klaus Schoeffmann, Mario Taschwer, Doris Putzgruber-Adamitsch, and Yosuf El-Shabrawi. 2020. Evaluating the Generalization Performance of Instrument Classification in Cataract Surgery Videos. Proceedings of the 26th International Conference on Multimedia Modeling 2020 (MMM2020). Lecture Notes in Computer Science, Vol 11962, Springer International Publishing, Cham, 626-636.
Sabrina Kletz, Klaus Schoeffmann, Jenny Benois-Pineau, and Heinrich Husslein. 2019. Identifying Surgical Instruments in Laparoscopy Using Deep Learning Instance Segmentation. Proceedings of the International Conference on Content-Based Multimedia Indexing (CBMI 2019). IEEE, Los Alamitos, CA, USA, 6 pages
Cheng Peng, Qing Xu, Yuejun Guo, and Klaus Schoeffmann. 2019. Eye Movement-based Analysis on Methodologies and Efficiency in the Process of Image Noise Evaluation. Proceedings of the 28th International Conference on Artificial Neural Networks, ICANN2019, Springer LNCS, Vol. 11729, 29-40.
Fabian Berns, Luca Rossetto, Klaus Schoeffmann, Christian Beecks, and George Awad. 2019. V3C1 Dataset: An Evaluation of Content Characteristics. In Proceedings of the 2019 on International Conference on Multimedia Retrieval (ICMR ’19). ACM, New York, NY, USA, 334-338. DOI:
Sabrina Kletz, Andreas Leibetseder, and Klaus Schoeffmann. 2019. A comparative study of video annotation tools for scene understanding: yet (not) another annotation tool. In Proceedings of the 10th ACM Multimedia Systems Conference (MMSys ’19). ACM, New York, NY, USA, 133-144. DOI:
Cathal Gurrin, Klaus Schoeffmann, Hideo Joho, Bernd Münzer, Rami Albatal, Frank Hopfgartner, Liting Zhou, and Duc-Tien Dang-Nguyen. 2019. A Test Collection for Interactive Lifelog Retrieval. In Proceedings of the 25th International Conference on Multimedia Modeling 2019 (MMM2019). Lecture Notes in Computer Science, vol 11295, Springer International Publishing, Cham, 13 pages.
Jakub Lokoc, Werner Bailer, and Klaus Schoeffmann. 2018. What is the Role of Similarity for Known-Item Search at Video Browser Showdown? In Proceedings of the SISAP 2018 – 11th International Conference on Similarity Search and Applications, Lecture Notes on Computer Science (LNCS), Vol. 11223, Springer International Publishing, 96-104.
Stefan Petscharnig, Klaus Schöffmann, Jenny Benois-Pineau, Souad Chaabouni and Jörg Keckstein. 2018. Early and Late Fusion of Temporal Information for Classification of Surgical Actions in Laparoscopic Gynecology. In Proceedings of the 31st IEEE CBMS International Symposium on Computer-Based Medical Systems. IEEE, Los Alamitos, CA, USA, 6 pages, 369-374.
Andreas Leibetseder, Stefan Petscharnig, Manfred Jürgen Primus, Sabrina Kletz, Bernd Münzer, Klaus Schoeffmann, and Jörg Keckstein. 2018. Lapgyn4: a dataset for 4 automatic content analysis problems in the domain of laparoscopic gynecology. In Proceedings of the 9th ACM Multimedia Systems Conference (MMSys ’18). ACM, New York, NY, USA, 357-362. DOI:
Link Dataset
Klaus Schoeffmann, Mario Taschwer, Stephanie Sarny, Bernd Münzer, Manfred Jürgen Primus, and Doris Putzgruber. 2018. Cataract-101: video dataset of 101 cataract surgeries. In Proceedings of the 9th ACM Multimedia Systems Conference (MMSys ’18). ACM, New York, NY, USA, 421-425. DOI:
Link Dataset
Andreas Leibetseder and Klaus Schoeffmann. 2018. Extracting and Using Medical Expert Knowledge to Advance in Video Processing for Gynecologic Endoscopy. In Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval (ICMR ’18). ACM, New York, NY, USA, 485-488. DOI:
Sabrina Kletz, Andreas Leibetseder, and Klaus Schoeffmann. 2018. Evaluation of Visual Content Descriptors to Support AVS Tasks at the Video Browser Showdown. In Proceedings of the 24th International Conference on Multimedia Modeling 2018 (MMM2018). Lecture Notes in Computer Science, vol 10704, Springer, Cham, 203-215.
Manfred J. Primus, Doris Putzgruber-Adamitsch, Mario Taschwer, Bernd Münzer, Yosuf El-Shabrawi, Laszlo Böszörmenyi, and Klaus Schoeffmann. 2018. Frame-Based Classification of Operation Phases in Cataract Surgery Videos. In Proceedings of the 24th International Conference on Multimedia Modeling 2018 (MMM2018). Lecture Notes in Computer Science, vol 10704, Springer, Cham, 241-253.
Best Paper Candidate
Bernd Münzer, Manfred J. Primus, Sabrina Kletz, Stefan Petscharnig, and Klaus Schoeffmann. 2017. Static vs. Dynamic Content Descriptors for Video Retrieval in Laparoscopy. In Proceedings of the IEEE International Symposium on Multimedia 2017 (ISM2017), IEEE, Los Alamitos, CA, 216-223.
Christian Beecks, Sabrina Kletz and Klaus Schoeffmann. 2017. Large-Scale Endoscopic Image and Video Linking with Gradient-based Signatures. In Proceedings of the Third IEEE International Conference on Multimedia Big Data (BigMM). IEEE, Los Alamitos, CA, USA, 17-21.
Stefan Petscharnig and Klaus Schoeffmann. 2017. Deep Learning of Shot Classification in Gynecologic Surgery Videos. In Proceedings of the 23rd International Conference on Multimedia Modeling (MMM2017). Springer, Cham, LNCS 10132, 702-713.
Marco A. Hudelist and Klaus Schoeffmann. 2017. An Evaluation of Video Browsing on Tablets with the ThumbBrowser, In Proceedings of the 23rd International Conference on Multimedia Modeling (MMM2017). Springer, Cham, LNCS 1033, 89-100.
Bernd Münzer, Klaus Schoeffmann,and Laszlo Böszörmenyi. 2016. Domain-Specific Video Compression for Long-term Archiving of Endoscopic Surgery Videos. In Proceedings of the 29th IEEE International Symposium on Computer-Based Medical Systems (CBMS 2016), IEEE, Los Alamitos, CA, USA, 312-317.
Klaus Schoeffmann, Christian Beecks, Mathias Lux, Merih Seran Uysal, and Thomas Seidl. 2016. Content-based Retrieval in Videos from Laparoscopic Surgery. In Proceedings of SPIE 9786, Medical Imaging 2016: Image-Guided Procedures, Robotic Interventions, and Modeling, 97861V, San Diego, CA, USA, 10 pages.
Klaus Schoeffmann, Marco A. Hudelist, Bonifaz Kaufmann, and Kevin Chromik. 2016. Interactive Search in Video: Navigation With Flick Gestures vs. Seeker-Bars. In Proceedings of the 22nd International Conference on MultiMedia Modelling 2016 (MMM 2016), Miami, FL, USA, Lecture Notes on Computer Science (LNCS), Vol. 9516, Springer International Publishing, 370-381.
Christian Beecks, Klaus Schoeffmann, Mathias Lux, Merih Seran Uysal, and Thomas Seidl. 2015. Endoscopic Video Retrieval: A Signature-based Approach for Linking Endoscopic Images with Video Segments. In Proceedings of the IEEE International Symposium on Multimedia 2015 (ISM 2015), IEEE, Los Alamitos, CA, USA, 33-38.
Klaus Schoeffmann and Lukas Burgstaller. 2015. Scrubbing Wheel: An Interaction Concept to Improve Video Content Navigation on Devices with Touchscreens. In Proceedings of the IEEE International Symposium on Multimedia 2015 (ISM 2015), IEEE, Los Alamitos, CA, USA, 351-356.
Marco A. Hudelist, Klaus Schoeffmann, and Qing Xu. 2015. Improving Interactive Known-Item Search in Video with the Keyframe Navigation Tree. In Proceedings of the 21st International Conference on MultiMedia Modelling 2015 (MMM 2015), Sydney, Australia, Lecture Notes on Computer Science (LNCS), Vol. 8935, Springer International Publishing, 306-317.
Claudiu Cobarzan, Marco A. Hudelist, Klaus Schoeffmann, and Manfred J. Primus. 2015. Mobile Image Analysis: Android vs. iOS. In Proceedings of the 21st International Conference on MultiMedia Modelling 2015 (MMM 2015), Sydney, Australia, Lecture Notes on Computer Science (LNCS), Vol. 8936, Springer International Publishing, 99-110.
Klaus Schoeffmann. 2014. The Stack-of-Rings Interface for Large-Scale Image Browsing on Mobile Touch Devices. In Proceedings of the 22nd ACM international conference on Multimedia (MM ’14). ACM, New York, NY, USA, 1097-1100. DOI:
Xiaoxiao Luo, Qing Xu, Mateu Sbert, Klaus Schoeffmann. 2014. F-Divergences Driven Video Key Frame Extraction. In Proceedings of the IEEE International Conference on Multimedia & Expo (ICME 2014), IEEE, Los Alamitos, CA, USA, 6 pages.
Bernd Münzer, Klaus Schoeffmann, Laszlo Böszörmenyi, JF Smulders, and Jack J. Jakimowicz. 2014. Investigation of the Impact of Compression on the Perceptional Quality of Laparoscopic Videos. In Proceedings of the 27th IEEE International Symposium on Computer-Based Medical Systems (CBMS 2014), IEEE, Los Alamitos, CA, USA, 153-158.
Marco A. Hudelist, Claudiu Cobârzan, and Klaus Schoeffmann. 2014. OpenCV Performance Measurements on Mobile Devices. In Proceedings of International Conference on Multimedia Retrieval (ICMR ’14). ACM, New York, NY, USA, , Pages 479 , 4 pages. DOI=
Claudiu Cobarzan and Klaus Schoeffmann. 2014. How do Users Search with Basic HTML5 Video Players? In Proceedings of the 20th International Conference on MultiMedia Modeling (MMM2014), Springer International Publishing, LNCS 8325, 109-120.
Best Poster Paper
Marco A. Hudelist, Klaus Schoeffmann and David Ahlström. 2013. Evaluation of Image Browsing Interfaces for Smartphones and Tablets. In Proceedings of the 2013 IEEE International Symposium on Multimedia (ISM 2013), IEEE, Los Alamitos, CA, USA, 1-8.
Bernd Münzer, Klaus Schoeffmann and Laszlo Böszörmenyi. 2013. Relevance Segmentation of Laparoscopic Videos. In Proceedings of the 2013 IEEE International Symposium on Multimedia (ISM 2013), IEEE, Los Alamitos, CA, USA, 84-91.
Klaus Schoeffmann and Claudiu Cobarzan. 2013. An Evaluation of Interactive Search with Modern Video Players. In Proceedings of the 2013 IEEE International Conference on Multimedia and Expo (ICME 2013), IEEE, Los Alamitos, CA, USA, 1-4.
Bernd Münzer, Klaus Schoeffmann and Laszlo Böszörmenyi. 2013. Improving Encoding Efficiency of Endoscopic Videos by using Circle Detection based Border Overlays. In Proceedings of the 2013 IEEE International Conference on Multimedia and Expo (ICME 2013), IEEE, Los Alamitos, CA, USA, 4 pages.
Bernd Münzer, Klaus Schoeffmann and Laszlo Böszörmenyi. 2013. Detection of Circular Content Area in Endoscopic Videos. In Proceedings of the 26th International Symposium on Computer-Based Medical Systems (CMBS 13), IEEE, Los Alamitos, CA, USA, 534-536.
Werner Bailer, Klaus Schoeffmann, David Ahlström, Wolfgang Weiss and Manfred Del Fabro. 2013. Interactive Evaluation of Video Browsing Tools. In Proceedings of the 19th International Conference on Multimedia Modeling (MMM 2013), LNCS 7732, Springer, Heidelberg, Germany, 81-91.
David Ahlström, Marco A. Hudelist, Klaus Schoeffmann, and Gerald Schaefer. 2012. A user study on image browsing on touchscreens. In Proceedings of the 20th ACM international conference on Multimedia (MM ’12). ACM, New York, NY, USA, 925-928. DOI:
Klaus Schoeffmann, David Ahlström and Laszlo Böszörmenyi. 2012. 3D Storyboards for Interactive Visual Search. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2012), IEEE, Los Alamitos, CA, USA, 848-853.
Klaus Schoeffmann and Laszlo Boeszoermenyi. Video Sequence Identification in TV Broadcasts, in Proceedings of the 17th International Multimedia Modeling Conference (MMM 2011), Taipei, Taiwan, 2011, pp. 129-139
Manfred del Fabro, Klaus Schoeffmann, Laszlo Boeszoermenyi Instant Video Browsing: A Tool for Fast Nonsequential Hierarchical Video Browsing, in HCI in Work and Learning, Life and Leisure, Vol. 6389, Lecture Notes in Computer Science, G. Leitner, M. Hitz, and A. Holzinger (Eds.), Klagenfurt, Austria, 2010, pp. 443-446
Klaus Schoeffmann, Mario Taschwer, and Laszlo Boeszoermenyi. 2010. The video explorer: a tool for navigation and searching within a single video based on fast content analysis. In Proceedings of the first annual ACM SIGMM conference on Multimedia systems (MMSys ’10). ACM, New York, NY, USA, 247-258. DOI=
Klaus Schoeffmann, Mathias Lux, Mario Taschwer, and Laszlo Boeszoermenyi. Visualization of Video Motion in Context of Video Browsing, in Proceedings of the IEEE International Conference on Multimedia and Expo, New York, USA, 2009, pp. 568-661
Klaus Schoeffmann, Mathias Lux, and Laszlo Boeszoermenyi. A Novel Approach for Fast and Accurate Commercial Detection in H.264/AVC Bit Streams Based on Logo Identification, in Proceedings of the 15th International Multimedia Modeling Conference (MMM 2009), Sophia-Antipolis, France, 2009, pp. 119-127
Klaus Schoeffmann and Laszlo Boeszoermenyi. Fast Segmentation of H.264/AVC Bitstreams for On-Demand Video Summarization, in Proceedings of the 14th International Multimedia Modeling Conference (MMM 2008), Kyoto, Japan, 2008, pp. 265-276
Klaus Schoeffmann, Markus Fauster, Oliver Lampl, and Laszlo Boeszoermenyi. An Evaluation of Parallelization Concepts for Baseline-Profile Compliant H.264/AVC Decoders, in Proceedings of the 13th International Euro-Par Conference (EuroPar 2007), Rennes, France, 2007, pp. 782-791