Full Text Available

Access Repository

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

Investigating the virtual directing strategies of a virtual cinematographer in an automatic lecture video post-processing system

As recording technology improves and becomes more affordable, many learning institutions are using lecture recording to make lessons more persistent and accessible. Statically mounted 4K cameras are now cheaper than PTZ cameras which makes them a desirable alternative for lecture recordings. Unfortu...

Full description

Saved in:

Bibliographic Details
Main Author:	Khatieb, Mohamed Tanweer
Other Authors:	Marais, Patrick
Format:	Thesis
Language:	English
Published:	Department of Computer Science 2024
Subjects:	Computer Science
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1867613251077931008
access_status_str	Open Access
author	Khatieb, Mohamed Tanweer
author2	Marais, Patrick
author_browse	Khatieb, Mohamed Tanweer Marais, Patrick
author_facet	Marais, Patrick Khatieb, Mohamed Tanweer
author_sort	Khatieb, Mohamed Tanweer
collection	Thesis
description	As recording technology improves and becomes more affordable, many learning institutions are using lecture recording to make lessons more persistent and accessible. Statically mounted 4K cameras are now cheaper than PTZ cameras which makes them a desirable alternative for lecture recordings. Unfortunately, 4K resolution videos are very large, posing a problem for storage and streaming - the file size for a 45 - 60 minute lecture video in 4K can exceed 2GB. Many students cannot afford the bandwidth required to stream such large files. Furthermore, since static 4K cameras do not move, they require a wide-angle view of the venue in order to capture as much of the front of the venue as possible. This view is much too zoomed out for viewers to see the details, such as writing on the boards and the presenter's facial expressions, captured by the 4K resolution. This dissertation investigates an approach to post-processing these 4K lecture videos to reduce the file size and emphasise lecture details such as lecture motion and board/screen usage. This is done using scene tracking data (generated via a third-party front-end) which a Virtual Cinematographer (VC) uses to make decisions on about which areas to crop from each 4K frame in the original video. The VC then positions and sizes the cropping windows in such a way that the resultant, cropped video resembles one recorded by a human camera operator. This is accomplished using cinematographic heuristics to inform its decision-making. The VC uses scene analysis algorithms to determine how the environment changes as time progresses in the video. By dividing the video into “chunks” (equivalent to “scenes” in traditional cinematography) based on context, the VC is able to maintain stable shots with consistent framing to avoid jittery and disorienting footage. These contextual chunks are determined by comparing the trajectory of the presenter with the manner in which the features on the board regions change over time. After the chunks are established, the VC creates transitions between them while avoiding any changes to the framing inside each chunk. The final output is a JSON file containing the cropping coordinates for each frame in the video for a third-party video cropping application to use when producing the final video. We performed a user evaluation of the VC to measure user satisfaction with the resulting output videos and how successful it was at following its heuristics. The VC succeeded in following the major heuristics such that viewers were satisfied with the output based on the framing of the presenter and the content on the boards, transition stability and smoothness of motion, and transition frequency with the VC only changing shots when necessary.
format	Thesis
id	oai:open.uct.ac.za:11427/40692
institution	University of Cape Town (South Africa)
language	eng
last_indexed	2026-06-10T12:33:10.259Z
license_str	Not specified — see source repository
provenance_str_mv	Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository
publishDate	2024
publishDateRange	2024
publishDateSort	2024
publisher	Department of Computer Science
publisherStr	Department of Computer Science
record_format	dspace
source_str	UCTD — University of Cape Town Open Access Repository
spelling	oai:open.uct.ac.za:11427/40692 Investigating the virtual directing strategies of a virtual cinematographer in an automatic lecture video post-processing system Khatieb, Mohamed Tanweer Marais, Patrick Marquard, Stephen Marquard, Stephen Computer Science As recording technology improves and becomes more affordable, many learning institutions are using lecture recording to make lessons more persistent and accessible. Statically mounted 4K cameras are now cheaper than PTZ cameras which makes them a desirable alternative for lecture recordings. Unfortunately, 4K resolution videos are very large, posing a problem for storage and streaming - the file size for a 45 - 60 minute lecture video in 4K can exceed 2GB. Many students cannot afford the bandwidth required to stream such large files. Furthermore, since static 4K cameras do not move, they require a wide-angle view of the venue in order to capture as much of the front of the venue as possible. This view is much too zoomed out for viewers to see the details, such as writing on the boards and the presenter's facial expressions, captured by the 4K resolution. This dissertation investigates an approach to post-processing these 4K lecture videos to reduce the file size and emphasise lecture details such as lecture motion and board/screen usage. This is done using scene tracking data (generated via a third-party front-end) which a Virtual Cinematographer (VC) uses to make decisions on about which areas to crop from each 4K frame in the original video. The VC then positions and sizes the cropping windows in such a way that the resultant, cropped video resembles one recorded by a human camera operator. This is accomplished using cinematographic heuristics to inform its decision-making. The VC uses scene analysis algorithms to determine how the environment changes as time progresses in the video. By dividing the video into “chunks” (equivalent to “scenes” in traditional cinematography) based on context, the VC is able to maintain stable shots with consistent framing to avoid jittery and disorienting footage. These contextual chunks are determined by comparing the trajectory of the presenter with the manner in which the features on the board regions change over time. After the chunks are established, the VC creates transitions between them while avoiding any changes to the framing inside each chunk. The final output is a JSON file containing the cropping coordinates for each frame in the video for a third-party video cropping application to use when producing the final video. We performed a user evaluation of the VC to measure user satisfaction with the resulting output videos and how successful it was at following its heuristics. The VC succeeded in following the major heuristics such that viewers were satisfied with the output based on the framing of the presenter and the content on the boards, transition stability and smoothness of motion, and transition frequency with the VC only changing shots when necessary. 2024-11-08T08:54:58Z 2024-11-08T08:54:58Z 2023 2024-09-03T09:33:36Z Thesis / Dissertation Masters Masters http://hdl.handle.net/11427/40692 eng application/pdf Department of Computer Science Faculty of Science
spellingShingle	Computer Science Khatieb, Mohamed Tanweer Investigating the virtual directing strategies of a virtual cinematographer in an automatic lecture video post-processing system
thesis_degree_str	Master's
title	Investigating the virtual directing strategies of a virtual cinematographer in an automatic lecture video post-processing system
title_full	Investigating the virtual directing strategies of a virtual cinematographer in an automatic lecture video post-processing system
title_fullStr	Investigating the virtual directing strategies of a virtual cinematographer in an automatic lecture video post-processing system
title_full_unstemmed	Investigating the virtual directing strategies of a virtual cinematographer in an automatic lecture video post-processing system
title_short	Investigating the virtual directing strategies of a virtual cinematographer in an automatic lecture video post-processing system
title_sort	investigating the virtual directing strategies of a virtual cinematographer in an automatic lecture video post processing system
topic	Computer Science
url	http://hdl.handle.net/11427/40692
work_keys_str_mv	AT khatiebmohamedtanweer investigatingthevirtualdirectingstrategiesofavirtualcinematographerinanautomaticlecturevideopostprocessingsystem

Full Text Available

Investigating the virtual directing strategies of a virtual cinematographer in an automatic lecture video post-processing system

Similar Items