Text this: Design of an Integrated Model for Video Summarization Using Multimodal Fusion and YOLO for Crime Scene Analysis