13 minute read

相似视频检索: 相似指的是通过修改原视频的到新的视频;
相关资料:检索概述 · 图像检索资源 · 音频检索资源

video fingerprinting · video DNA · video signature · video hash · video content-based watermarking;
video retrieval; near-duplicate video detection · video copy detection · video forgery detection · video content identification; video vreification · video authentication ;

发现这方面论文很凌乱,关键字都有很多个版本;每个方向也不多

不同的应用场景下,是否应该用相同的策略

1 综述

1.1 重复视频检测

  1. Large-scale near-duplicate web video search: Challenge and opportunity
    2009 paper

  2. Million-scale Near-duplicate Video Retrieval System
    2011-11-28 paper
    $\bullet \bullet$
    使用颜色特征; K-Means++ 选择特征,倒排加速搜索;

  3. Spatio-temporal video copy detection
    2011 paper

  4. An Exploration based on Multifarious Video Copy Detection Strategies
    2013 paper

  5. Near-Duplicate Video Retrieval: Current Research and Future Trends
    2013-08 paper
    使用了一款视频编辑软件;阐述了全局特征的劣势——忽略了中间物体;

  6. Survey on Web Scale Based Near Duplicate Video Retrieval
    2016 paper

  7. 视频拷贝检测方法综述
    2017-04 复旦·计算机 姜育刚 paper
    全方位的概述;

  8. GVoS: A General System for Near-Duplicate Video Related Applications on Storm
    2017 paper

  9. A Systematic Review of Near Duplicate Video Retrieval Techniques
    2018-05-27 paper

    太简单,就是一个粗糙的博客;没有思考性的东西,也缺乏大量实验比对;图表模糊,不合格;

1.2 拷贝检测

  1. Video Copy Detection: a Comparative Study
    2007 paper

  2. A Survey On Video Forgery Detection
    2015-03-03 paper

  3. Recent advances in content based video copy detection
    2016-10-28 paper

  4. A review on robust video copy detection
    2019 springer
    $\bullet \bullet$ VCD Review

1.3 视频签名

  1. Video Content Identification Using Video Signature: Survey
    2017-07 paper

1.4 基于内容的视频检索

  1. Content-Based Image Retrieval at the end of the early years
    2000 paper
    汇总了各种方法,展示了遗留的问题;

  2. Content based Video Retrieval: A Survey
    2015-01 paper

  3. Content-Based Video Retrieval in Historical Collections of the German Broadcasting Archive
    2017-02-13 paper
    提供了丰富的视频检索流程;

  4. FIVR: Fine-grained Incident Video Retrieval
    2018-09-11 paper
    发布了新数据集 FIVR-200K;

  5. Interactive video retrieval in the age of deep learning
    ICMR 2019 toturial 2019 paper

1.5 跨模态视频检索

  1. Find and Focus: Retrieve and Localize Video Events with Natural Language Queries
    ECCV 2018 2018 paper | home

  2. Deep Learning for Video Retrieval by Natural Language
    2019-10-25 paper

1.6 版权保护

  1. DETECTING DIGITAL COPYRIGHT VIOLATIONS ON THE INTERNET
    1999 paper

1.7 视频指纹

  1. Introduction to Video Fingerprinting
    2009 paper

  2. Video fingerprinting for copy identification: From research to industry applications
    2009 paper

  3. The Core of Video Fingerprinting: Examples of Feature
    2009 paper
    $\bullet \bullet$

2 理论

3 技术点

3.1 关键帧提取

keyframe extract · shot boundary detection;

3.2 特征提取

包括低层特征:颜色、边缘、纹理、动作;
中级特征:运动轨迹、物体颜色;
高级特征:;

feature extract · video fingerprinting · video DNA · video signature · video hash · video content-based watermarking;

3.2.1 帧特征

3.2.2 视频特征

  1. Robust video signature based on ordinal measure
    2004 paper
    重采样应对帧率变化;顺序度量特征,固定滑窗计算相似度;

  2. Video Shot Characterization∗
    paper

  3. A Rotation Invariant Descriptor for Robust Video Copy Detection
    2007 paper

  4. Content-based video fingerprinting method for fast key generation and retrieval

  5. VIDEO COPY DETECTION USING FINGERPRINTING WITH FAST IMAGE PROCESSING
    2016-04 paper

  6. Deep Video Hashing
    2016 paper
    $\bullet \bullet$ Deep Video Hashing

  7. Video Copy Detection Based On Temporal Contextual Hashing
    2016
    时序二进制 hash;

  8. Unsupervised Deep Video Hashing with Balanced Rotation
    IJCAI 2017 2017 paper
    $\bullet \bullet$
    无监督特征提取;

  9. Deep Hashing with Category Mask for Fast Video Retrieval
    2017-12-22 美图 paper | blog
    $\bullet \bullet$ Hash Mask
    使用网络提取视频特征:先用 backbone 网络针对每一帧提取特征,然后将多个特征进行融合,得到固定维度的特征就是视频特征;
    得到特征后使用阈值进行量化,得到 Hash 码;然后计算 hamming 距离;
    • 该方案的可扩展性有待深究(阈值的设定、特征融合对于输入帧数的限制);
    • 文章默认未考虑视频剪辑的情况;因为只适应从视频采样出固定的几帧,那么就只能完成检索任务,没有办法定位到时间;且如果采样到不相关的帧,对检索结果影响也较大;
    • 文章对于如何采样并没有说明;感觉工作不是很严谨;整体方案应是只停留在理想环境;
  10. Convolutional Hashing for Automated Scene Matching
    2018-02-09 paper

  11. A Video Database Management System for Advancing Video Database Research
    2018-02-09 paper
    cnn hash,设计了新的 loss 函数;

  12. Finding Near-Duplicate Videos in Large-Scale Collections
    2019

  13. A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis
    2019 paper

  14. Extracting camera-based fingerprints for video forensics
    CVPR 2019 workshop 2019 paper

3.2.3 多帧融合
3.2.3.1 聚类

  1. Towards effective indexing for very large video sequence database
    对帧特征进行聚类;

  2. Comprehensive Feature-based Robust Video Fingerprinting Using Tensor Model
    2016-01-27 paper
    $\bullet \bullet$
    特征融合及索引;

3.2.3.2 降维

  1. UQLIPS: A Real-time Near-duplicate Video Clip Detection System
    2007 paper
    使用 PCA 对多帧视频特征降维得到视频特征;

  2. Bounded coordinate system indexing for real-time video clip search
    2009 paper
    使用 BCS(PCA+) 对多帧视频特征降维得到视频特征;

  3. Real-time Large Scale Near-duplicate Web Video Retrieval

3.2.3.3 Fisher vector

  1. SHOT AGGREGATING STRATEGY FOR NEAR-DUPLICATE VIDEO RETRIEVAL
    2015 paper
    镜头聚合;

3.2.3.4 稀疏编码

  1. Compact CNN Based Video Representation for Efficient Video Copy Detection
    2016-12
    $\bullet \bullet$
    CNN 提取特征,然后稀疏编码到固定长度;

3.2.3.4 VLAD

  1. Aggregating local descriptors into a compact image representation
    2010 paper

3.2.3.5 其他

  1. Video histogram: A novel video signature for efficient Web video duplicate detection
    2007 paper
    相关直方图;

  2. Practical elimination of near-duplicates from Web video search
    2007 paper
    CC_Web_VIDEO 数据集,基于关键帧,一阶段用颜色,累积直方图,过滤不相似的视频;而阶段用 PCA-SIFT + 滑窗;

  3. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition
    2014-03-23 paper
    $\bullet \bullet$

  4. A discriminative CNN video representation for event detection
    2014-11-14 paper

  5. Hierarchical feature fusion hashing for near-duplicate video retrieval
    2016 paper
    $\bullet \bullet$

  6. Detecting near-duplicate videos by aggregating features from intermediate CNN layers
    2016-08 paper | tensorflow
    $\bullet \bullet$

  7. Near-duplicate video retrieval by aggregating intermediate cnn layers
    2017 paper | caffe
    $\bullet \bullet$ NDVR CNN
    流程:CNN 提取特征(每层都提);K-Means 生成码本(最佳值是 1000);Apache Spark 助力计算;
    对比了 AlexNet、VGG、GoogleNet 在视频检索中的表现;并且与传统方法做了比较;效果更好;

    实验很细致,也给出了一个相对完整的视频检索流程;只是没有考虑剪辑的情况;

  8. Effective Multiple Feature Hashing for Large-Scale Near-Duplicate Video Retrieval
    2013
    $\bullet \bullet$ MFH
    监督学习进行特征 hash;

3.2.4 压缩域

无需解码视频,根据编码信息提取特征,可应对裁剪等拷贝操作;

  1. Bit Rate-based H. 264 Video Copy Detection
    2018-01 paper
    $\bullet \bullet$ h.264
    流程:镜头检测(I 帧判断)——压缩域提取特征(P 帧比特率)——比对($\chi^2$ + 编辑距离);
    结果:只能与部分算法持平;且压缩域极易受剪辑攻击;

    这个方法不可行;关于 I 帧找镜头的方法还不错;

3.2.5 其他

  1. Submodular Video Hashing: A Unified Framework Towards Video Pooling and Indexing
    2012 paper

  2. Unsupervised t-Distributed Video Hashing and Its Deep Hashing Extension
    $\bullet \bullet$ t-UDH
    2016-12 paper

  3. Joint Audio-Video Fingerprint Media Retrieval Using Rate-Coverage Optimization
    2016-09-05 paper
    $\bullet \bullet$ Rate Coverage

  4. Unsupervised Deep Video Hashing via Balanced Code for Large-Scale Video Retrieval
    2018 paper
    $\bullet \bullet$ UDVH

3.3  索引

3.3.1 综述

  1. Review of Image and Video Indexing Techniques
    1997 semanticscholar

  2. Image and video indexing in the compressed domain
    1997 paper
    另一个名字是 A critical evaluation of image and video indexing techniques in the compressed domain
    针对频域索引技术进行了比对,包括傅立叶变换,余弦变换,Karhunen-Loeve变换,子带和小波的变换域技术;还有矢量量化和分形的空间域技术和基于运动矢量的时间索引技术;

  3. A state-of-the-art review on multimodal video indexing
    2002 paper

  4. Multimodal video indexing: A review of the state-of-the-art
    2005 paper
    多模态索引;

  5. A Survey on Visual Content-Based Video Indexing and Retrieval
    2011 paper

  6. Multimedia Indexing and Retrieval Techniques: A Review
    2012 paper

3.3.2 树形结构

  1. Prover: Probabilistic video retrieval using the Gauss-tree
    2007 paper

3.3.3 VA-File

  1. A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces
    1998 paper

  2. Trading quality for time with nearest-neighbor search
    2000 paper

  3. iDistance: An Adaptive B+-Tree Based Indexing Method for Nearest Neighbor Search
    2007 paper
    iDistance;

  4. Indexing High-Dimensional Data in Dual Distance Spaces: A Symmetrical Encoding Approach
    2008 paper

  5. Online Near-Duplicate Video Clip Detection and Retrieval: An Accurate and Fast System
    2009

  6. Towards effective indexing for very large video sequence database
    2005 paper
    iDistance;

3.3.4 Hash

  1. A Posteriori Multi-Probe Locality Sensitive Hashing
    2008 paper
    LSH: 提升准确度;

  2. Quality and Efficiency in High Dimensional Nearest Neighbor Search
    2009 paper
    LSH: 改映射函数;

  3. An Image-Based Approach to Video Copy Detection With Spatio-Temporal Post-Filtering
    2010 paper
    基于关键点,LSH 改进,词袋模型;

  4. Large-scale video copy retrieval with temporal-concentration SIFT
    2016
    改进 sift;LSH:位置敏感 hash,解决高维特征检索慢的问题;

  5. Stochastic Multiview Hashing for Large-Scale Near-Duplicate Video Retrieval
    2016-09-15 paper
    $\bullet \bullet$ stochastic hash
    镜头边界检测,汉明距离,map;多视角随机 hash 提升检索精度和速度;

3.3.5 倒排

  1. Video Google: a text retrieval approach to object matching in videos
    2003 paper

  1. Scalable Detection of Partial Near-Duplicate Videos by Visual-Temporal Consistency
    2009 paper
    $\bullet \bullet$ Network Flow
    倒排,针对弱几何一致性做了改进;使用了图对齐;

  1. Real-time Large Scale Near-duplicate Web Video Retrieval
    2010 paper
    倒排,map+准确度+召回率;LBP 直方图相交;词袋模型+时序特征;

3.4 时间对齐

temporal alignment · local alignment

3.4.1 滑窗

  1. Compact video description for copy detection with precise temporal alignment
    ECCV 2010 2010 paper
    分层索引,霍夫投票;

3.4.2 树形结构

  1. Searching for repeated video sequences
    2007 paper

3.4.3 动态规划

  1. Detection of video sequences using compact signatures
    2006 paper

3.4.4 图对齐

最大流

  1. Effective and Efficient Query Processing for Video Subsequence Identification
    2007 paper
    $\bullet \bullet$ MSM SMSM

  2. Scalable Detection of Partial Near-Duplicate Videos by Visual-Temporal Consistency

  3. Efficient mining of multiple partial near-duplicate alignments by temporal network
    2010

  4. Detection and location of near-duplicate video sub-clips by finding dense subgraphs
    2011 paper

  5. Identification of video subsequence using bipartite graph matching
    2011 paper
    $\bullet \bullet$ Graph bipartite

  6. A two-step video subsequence identification based on bipartite graph matching 加了个命中函数;

  7. Video Hyperlinking: Libraries and Tools for Threading and Visualizing Large Video Collection
    2012 paper | tool

  8. Video Copy Detection Based on Path Merging and Query Content Prediction
    2015-10
    基于图模型的路径合并;

3.4.5 DL

  1. Temporal Cycle-Consistency Learning
    CVPR 2019 2019-04-16 Google & Deepmind paper | project | tensorflow-official

  1. Temporal Attentive Alignment for Video Domain Adaptation
    CVPR 2019 workshop 2019-05-26 paper | pytorch-official
    Temporal Attentive Alignment for Large-Scale Video Domain Adaptation 是同一团队;

  1. Temporal Attentive Alignment for Large-Scale Video Domain Adaptation
    ICCV 2019 oral 2019-07-30 paper | pytorch-official | reddit
    Temporal Attentive Alignment for Video Domain Adaptation 是同一团队;

3.4.6 蛋白质序列

  1. Searching for Near-Duplicate Video Sequences from a Scalable Sequence Aligner
    2013-11-19 paper

3.4.7 多项式逼近

只能应对滑窗试匹配;

polynomial approximation

  1. A novel scheme for fast and efficient video sequence matching using compact signatures
    2000 paper
    特征使用的 DCT 直方图,针对片段匹配;

3.4.8 其他

  1. Video sequence matching based on temporal ordinal measurement
    2007 paper
    提取了时间维度特征;

  2. Fast Visual Retrieval Using Accelerated Sequence Matching
    2010 paper

  3. Efficient video copy detection via aligning video signature time series
    2012
    基于倾斜的时间对齐;对齐时引入了帧的插入、删除和替换策略;

  4. Frame filtering and path verification for improving video copy detection
    2013 paper
    $\bullet \bullet$
    针对定位精度和就三效率做了处理;

  5. Block based video alignment with linear time and space complexity
    ICIP 2016-09 paper
    $\bullet \bullet$ DDWT
    线性时间和空间复杂度;

  6. Energy based fast event retrieval in video with temporal match kernel
    ICIP 2017 2017-09 paper | ppt
    乘积量化;

  7. Temporal Matching Kernel with Embedded Stability-Sensitive Filter
    2017-12

  8. Circulant Temporal Encoding for Video Retrieval and Temporal Alignment
    2015-06-08 paper
    $\bullet \bullet$ Circulant
    使用复数处理特征,以达到精确匹配;

  9. Burst-survive Temporal Matching Kernel with Fibonacci Periods
    ICASSP 2019 2019 paper | python

  10. Neighborhood Preserving Hashing for Scalable Video Retrieval
    ICCV 2019 2019 清华深研院 paper

3.5 相似度度量

序列相似度度量字符串匹配

sequence matching · similarity measure
longest common sub-sequence

3.5.1 全局匹配

3.5.1.1 编辑距离

  1. A distance measure for video sequence similarity matching
    1998 香港中文 paper
    distance measure · edit distance · sequence-tosequence matching · video string
    贡献:在传统 edit distance 上融入了 vstring 的概念,并加入了三种操作来适应序列间的匹配;
    说明:vstring 负责序列数据的组织,edit distance 负责非等长数据之间的匹配;
    算法特点:对空间特征的鲁棒性要求较高;空间和时间复杂度较高;
    疑问
    • vstring 怎么实现的;
    • edit distance 怎么实现的;

      感觉 ED 系列的方法都是针对本身就相似的特征,使用串匹配的方法详细计算其相似度,所以更适合用来计算数据之间的相似性,而不适合用在拷贝检测相关问题中;因为拷贝检测问题是为了找出该视频副本,也就是副本和原视频越相似越好;而 ED 则是很客观地给出副本和原视频的不相似程度;

  2. Comparison of distance measures for video copy detection
    2001 IBM paper
    $\bullet \bullet$ IBM
    比较了几种特征度量的方法;
    特征:直方图(RGB,HSV,梯度)、局部梯度、不变矩、差分;
    度量:Hash(直方图相交,分块),L2(max·min);

    针对图像相似度计算做了简单汇总(更像是个博客); 拿视频当幌子,根本没提序列匹配及时间定位问题;

  3. Spatiotemporal sequence matching for efficient video copy detection
    2005 Semantic Scholar

  4. Beyond Distance Measurement: Constructing Neighborhood Similarity for Video Annotation
    2009-04 paper
    邻域度量;

  5. Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval
    2009 paper
    $\bullet \bullet$ proximity
    针对关键帧检索任务(通过文字或图像检索),在单帧的局部特征上用词袋模型,合成帧的全局特征,然后用 EMD 算法计算距离;

  6. An image-based near-duplicate video retrieval and localization using improved edit distance
    2017
    改进 ED, 可以过滤不相似的帧;

3.5.2 局部匹配

3.5.2.1 沃特曼

  1. Video copy detection by fast sequence matching
    2009 paper
    $\bullet \bullet$ Fast Sequence
    先前的相似度计算方法是基于关键帧的,没有考虑时间连续性;编辑距离 ED(最长公共子序列是 ED 的特例)虽然考虑了时序,但是他是针对全局序列的匹配,无法应对剪辑和拼接;文章就此提出了针对局部序列匹配的改进版 ED 算法;
    采样:固定时间采样(1s);
    特征:-VSMF-color(半全局特征,马尔可夫);
    相似度:一方面,整体框架使用了 史密斯·沃特曼算法,内有 ED 的概念;另一方面,替换操作定义为常量减去两帧之间的距离;
    \(\begin{align} v(q_i, r_i) &= c - d(q_i, r_i) \\ d(q_i, r_i) &= \chi^2 = \sum_j {\frac{(q_{ij} - r_{ij})^2}{r_{ij}}} \\ \end{align}\) $i$ 是帧号,$x_i$ 是一帧图像的特征向量,$x_{ij}$ 是特征向量中的一个值;

3.5.2.2 最优时间公共子序列
Optimal Temporal Common Subsequence

  1. Efficient and Effective State-based Framework for News Video Retrival
    2010 paper
    当作多维字符串处理;
    $\bullet \bullet$

  2. The Optimal Temporal Common Subsequence
    2010 library

  3. Multiscale video sequence matching for near-duplicate detection and retrieval
    2018-05-04 paper
    $\bullet \bullet$ MS-VSM
    多阶段检索;

3.5.2.3 DTW

  1. A Time Warping Based Approach for Video Copy Detection
    2006 paper
    $\bullet \bullet$ DTW
    文章认为之前的方法速度太慢,且没有考虑视频间序列变化问题;因此进行了如下改进:
    优化速度
    提取关键帧:拉氏变换 + 时序峰值;
    提取片段:基于关键帧,确定可用的起止位置;组合后得到可用的片段;
    提升准确度
    精确计算视频距离(时间差分):当前对应帧距离 + 关联帧距离(关联帧距离中的最小值);

    文章所说的 DTW 中的动态,也只是在候选片段上多跑几次,没有太大的理论意义;
    文章大量篇幅在讲关键帧,但是未曾引用相关论文,且 TW 模块也是;因此论文工作量不足,可信度和理论解释也一般;

3.6 多特征&多模态

  1. A MULTIMODAL VIDEO COPY DETECTION APPROACH WITH SEQUENTIAL PYRAMID MATCHING∗
    2011 paper

4 应用方向

4.1 重复视频检测

又叫视频拷贝检测,Content Based Copy Detection, CDBC: 作用等同于水印,主要用于版权保护;不同之处在与他是直接从视频本身提取一些特征;

video copy detection · video forgery detection · near-duplicate video detection · video tampering · video manipulation

  1. Real time repeated video sequence identification
    2003 paper

  2. Efficient Near-duplicate Detection and Sub-image Retrieval
    2004 paper
    哈希;

  3. Feature statistical retrieval applied to content-based copy identification
    2004 paper
    兴趣点签名,对打数据检索高效;

  4. Efficient and Effective Video Copy Detection Based on Spatiotemporal Analysis
    2007 paper

  5. Scalable mining of large video databases using copy detection
    2008 paper

  6. A Framework for Handling Spatiotemporal Variations in Video Copy Detection
    2008 rar

  7. A compact, effective descriptor for video copy detection
    2009 paper

  8. Video Copy Detection by Fast Sequence Matching
    2009 paper

  9. Scale-Rotation Invariant Pattern Entropy for Keypoint-based Near-Duplicate Detection
    2009 paper

  10. Realtime near duplicate elimination for web video search with content and context
    2009 paper

  11. A Robust and Fast Video Copy Detection System Using Content-Based Fingerprinting
    2011 paper

  12. Frame Fusion for Video Copy Detection
    2011 paper

  13. Multiple feature hashing for real-time large scale near-duplicate video retrieval
    2011-12 ppt
    发布了数据集 UQ_VIDEO

  14. Content Based Video Copy Detection: Issues and Practices
    2012 paper

  15. Implementation of a Content Based Video Copy Detection Using Spatio-Temporal Measure
    2012 paper
    基于镜头;或许并未考虑序列问题;

  16. Content-Based Video Copy Detection Benchmarking at TRECVID
    2014 paper

  17. VCDB: A Large-Scale Database for Partial Copy Detection in Videos
    2014 paper
    发布了数据集

  18. Detection and Localization of Video Copy-Move Forgery in Temporal and Spatial Domain
    2015 paper

  19. Pattern-Based Near-Duplicate Video Retrieval and Localization on Web-Scale Videos
    2015 paper
    $\bullet \bullet$
    基于模式的特征处理,过滤不相似的视频;

  20. Frame-level matching of near duplicate videos based on ternary frame descriptor and iterative refinement
    ICIP 2015 2015 paper

  21. Partial Copy Detection in Videos: A Benchmark and An Evaluation of Popular Methods
    2016-03-01 paper
    $\bullet \bullet$ TBD
    公开 VCDB 数据集,有 10 万条视频,9000 对;深度局部特征+词袋模型;

  22. Near-Duplicate Video Detection Based on an Approximate Similarity Self-Join Strategy
    2016-04 paper

  23. Near Duplicate Video Retrieval using Spatio Temporal Approach with Multifeature Mechanism
    2016-06 paper

  24. Near-Duplicate Video Retrieval with Deep Metric Learning
    ICCV 2017 workshop 2017 paper
    $\bullet \bullet$ DML

  25. Video Copyright Detection Using High Level Objects in Video Clip
    2017-12 paper
    $\bullet \bullet$ objects
    基于镜头找相似片段;

  26. A new technique for video copy-move forgery detection
    2017 paper

  27. A PatchMatch-based Dense-field Algorithm for Video Copy-Move Detection and Localization
    2017-03-14 paper
    发布了新的数据集;

  28. Autoencoder with recurrent neural networks for video forgery detection
    2017-08-29 paper

  29. TRECVID 2018: Benchmarking Video Activity Detection, Video Captioning and Matching, Video Storytelling Linking and Video Search
    2018-11-12 paper

  30. A Coarse-to-fine Deep Convolutional Neural Network Framework for Frame Duplication Detection and Localization in Forged Videos
    2018-11-27 paper
    $\bullet \bullet$ coarse-fine cnn

  31. Simple Yet Efficient Content Based Video Copy Detection
    2018-04-19 paper

  32. Capsule-forensics: Using Capsule Networks to Detect Forged Images and Videos
    2018-10-26 paper

  33. Geometrically robust video hashing based on ST-PCT for video copy
    2019-04-08

  34. Video tampering localisation using features learned from authentic content
    2019-01-11 paper
    $\bullet \bullet$ tampering localisation

  35. We Need No Pixels: Video Manipulation Detection Using Stream Descriptors
    ICML 2019 Worksop 2019-06-20 paper

  36. Partial-copy detection of non-simulated videos using learning at decision level
    2019
    $\bullet \bullet$ decision level

  37. SVD: A Large-Scale Short Video Dataset for Near-Duplicate Video Retrieval
    ICCV 2019 2019 抖音+南大 lambda paper
    $\bullet \bullet$ SVD
    发布了 SVD 数据集;

4.2 签名验证

video vreification

  1. Online Signature Verification Based on Writer Specific Feature Selection and Fuzzy Similarity Measure
    CVPR 2019 (Applications to Media Forensics) 2019-05-21 paper

4.3 基于内容的视频检索

CBVR: 用于相似场景检测;

content based video retrieval · video content identification

  1. Video sequence matching
    1998
    基于动作做的检索;并且进行了时间对齐;

  2. A Fully Automated Content-Based Video Search Engine Supporting Spatiotemporal Queries
    1998 paper

  3. Retrieval of News Video using Video Sequence Matching
    2005 paper

  4. Content based video retrieval systems
    2012-05-08 paper

  5. Content Based Video Retrieval
    2016-05 paper
    $\bullet \bullet$ CBVR

  6. Content-based Video Indexing and Retrieval Using Corr-LDA
    2016-02-27 paper

  7. Exploiting detected visual objects for frame-level video filtering
    2017 paper
    $\bullet \bullet$
    融入了目标检测和跟踪;

  8. Use What You Have: Video Retrieval Using Representations From Collaborative Experts
    BMVC 2019 2019-07-31 paper | pytorch-official

4.4 重定位

找到给定视频在目标视频中的位置;

video relocation

  1. AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
    CVPR 2018 2018-05-23 paper | AVA-dataset

  2. Video Re-localization
    ECCV 2018 2018-08-05 腾讯 AI Lab + 美国罗切斯特大学 paper | tensorflow-official
    重组 ActivityNet 数据集视频,生成了一个符合研究需求的新数据集,并提出一种交叉过滤的双线性匹配模型;

  3. Spatio-temporal Video Re-localization by Warp LSTM
    2019-05-10 paper

4.5 跨模态检索

multi-modal hashing · cross-modal retrieval
siam-network
Zero-Example Video Retrieval · Text-Video Retrieval
Video-Music Retrieval

年份 AAAI ICML NIPS CVPR ICCV ECCV
2019 2 0   1    
2018 0   0 0   0
2017 1     0 0  
2016 1     0    
2015 0     1 0  
2014 0     0    
2013 0     0 0  
  1. 一种多模态融合的网络视频相关性度量方法
  2. 多模态视频分析技术研究与应用
  3. 优酷在多模态内容理解上的研究及应用

4.5.1 跨模态

  1. Coupled CycleGAN: Unsupervised Hashing Network for Cross-Modal Retrieval
    AAAI 2019 2019-03-06 paper
    UCH: 为了解决跨模态散列问题,使用 GAN 将特征提取和哈希连接起来;

4.5.2 图片

4.5.2.1 行人重识别

4.5.2.2 商品检索

  1. Video2Shop: Exactly Matching Clothes in Videos to Online Shopping Images
    CVPR 2017 2018-04-14 西南交大、阿里巴巴 paper
    AsymNet: 用 FasterRCNN 检测出物体,然后进行比对;

  2. Asymmetric Spatio-Temporal Embeddings for Large-Scale Image-to-Video Retrieval
    BMVC 2018 2018 paper

4.5.3 文本

4.5.3.1 广告植入

根据输入的关键词,给出视频中与之呼应的情景片段;

4.5.4 音频

  1. Content-Based Video-Music Retrieval Using Soft Intra-Modal Structure Constraint
    2017-04-22 paper | 示例 | tensorflow-official

  2. Cross-modal Embeddings for Video and Audio Retrieval
    2018-01-07 paper | Youtube-8m-训练


TOP

附录

A 参考资料

  1. Cross-Modal Retrieval-paper_with_code
  2. InVID H2020 Project
  3. Multiple feature hashing for real-time large scale near-duplicate video retrieval-引用文献
  4. scinapse
  5. video-retrieval-github
  6. 爱奇艺视频版权保护技术与维权实践 | 视频回放 | 陈赫
    从「发布之前」、「分发播放」、「盗版追踪」、「维权处理」四个环节介绍爱奇艺在版权保护方面的相关技术以及具体应用情况;
  7. awesomeCVpapers : 特征提取;

B 报告

  1. ICIP 2017 Near-Duplicate Video Detection Exploiting Noise Residual Traces
  2. Introduction to video hashing-limu

C 数据集

重复视频检测:

名称 类型 数量(训练/测试) 说明 发布日期
TRECVID & data     著名的视频拷贝检测竞赛 2001~2019
CC_WEB_VIDEO     包含颜色亮度变换,画面编辑和拼接 2007
MuscleVCD ST1        
UQ_VIDEO   169,952 查询视频 400 个,检索视频 20 万;单个视频对应的检索视频最多 1000 个;
提取到的关键帧有 3,305,525 个;
共 15G;
2011
Copy-move forgeries dataset        
VCDB     8G 2014
FIVR   225,960/100   2019
AVA        
SVD 抖音开源 500,000 ICCV 2019 2019
         

D 项目

  1. ThreatExchange
    code | TMK + PDQF-相似视频识别 | PDQ-相似图片识别
  2. duplicate video search

E 比赛

  1. TREC Video Retrieval Evaluation: TRECVID

F 研究员

  1. OLIVES Researchgithub

Comments