Convert a video with content type video/quicktime to video/mp4 using ffmpeg

1.2k Views Asked by At

I have a strange problem. I start with a video file recored with an iPhone. The output is video.mov. I convert this to mp4 using ffmpeg:

ffmpeg -i video.mov -vcodec h264 -acodec aac video.mp4

If I use a tool like exiftool to show the metadata the output for content-type is video/mp4 all seems to be correct. But my backend, which receives the video, uses a library called Apache Tika to extract the content type and the result here is: video/quicktime.

Scala code to retrieve the content type from the file:

// byteArray: Array[Byte]

val tika        = new TikaConfig()
val metadata    = new Metadata()
val contentType = tika.getDetector.detect(TikaInputStream.get(byteArray), metadata)

BTW: If I use a video editing program (DaVinci Resolve in my case) I can load the video, adjust settings and explicitly set the codec to mp4, export the video and then the resulting file works and Apache Tika extract the desired content-type. This is a workaround, but it does not scale and I like to automate this process via ffmpeg.

I'm stuck here. Can anyone give me a hint what I can do?

Versions:

tika: 1.22
ffmpeg: 4.4

ffmpeg log:

$ffmpeg -i video.mov -vcodec h264 -acodec aac video.mp4
ffmpeg version 4.4 Copyright (c) 2000-2021 the FFmpeg developers
  built with Apple clang version 12.0.0 (clang-1200.0.32.29)
  configuration: --prefix=/usr/local/Cellar/ffmpeg/4.4_2 --enable-shared --enable-pthreads --enable-version3 --cc=clang --host-cflags= --host-ldflags= --enable-ffplay --enable-gnutls --enable-gpl --enable-libaom --enable-libbluray --enable-libdav1d --enable-libmp3lame --enable-libopus --enable-librav1e --enable-librubberband --enable-libsnappy --enable-libsrt --enable-libtesseract --enable-libtheora --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxml2 --enable-libxvid --enable-lzma --enable-libfontconfig --enable-libfreetype --enable-frei0r --enable-libass --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenjpeg --enable-libspeex --enable-libsoxr --enable-libzmq --enable-libzimg --disable-libjack --disable-indev=jack --enable-avresample --enable-videotoolbox
  libavutil      56. 70.100 / 56. 70.100
  libavcodec     58.134.100 / 58.134.100
  libavformat    58. 76.100 / 58. 76.100
  libavdevice    58. 13.100 / 58. 13.100
  libavfilter     7.110.100 /  7.110.100
  libavresample   4.  0.  0 /  4.  0.  0
  libswscale      5.  9.100 /  5.  9.100
  libswresample   3.  9.100 /  3.  9.100
  libpostproc    55.  9.100 / 55.  9.100
Input #0, mov,mp4,m4a,3gp,3g2,mj2, from 'video.mov':
  Metadata:
    major_brand     : qt
    minor_version   : 0
    compatible_brands: qt
    creation_time   : 2021-06-24T11:19:37.000000Z
    com.apple.quicktime.location.accuracy.horizontal: 65.000000
    com.apple.quicktime.location.ISO6709: +52.5554+013.3772+042.268/
    com.apple.quicktime.make: Apple
    com.apple.quicktime.model: iPhone 8
    com.apple.quicktime.software: 14.4.2
    com.apple.quicktime.creationdate: 2021-06-24T13:19:36+0200
  Duration: 00:00:09.83, start: 0.000000, bitrate: 7954 kb/s
  Stream #0:0(und): Video: hevc (Main) (hvc1 / 0x31637668), yuv420p(tv, bt709), 1920x1080, 7810 kb/s, 30.01 fps, 30 tbr, 600 tbn, 600 tbc (default)
    Metadata:
      rotate          : 90
      creation_time   : 2021-06-24T11:19:37.000000Z
      handler_name    : Core Media Video
      vendor_id       : [0][0][0][0]
      encoder         : HEVC
    Side data:
      displaymatrix: rotation of -90.00 degrees
  Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 44100 Hz, mono, fltp, 96 kb/s (default)
    Metadata:
      creation_time   : 2021-06-24T11:19:37.000000Z
      handler_name    : Core Media Audio
      vendor_id       : [0][0][0][0]
  Stream #0:2(und): Data: none (mebx / 0x7862656D), 0 kb/s (default)
    Metadata:
      creation_time   : 2021-06-24T11:19:37.000000Z
      handler_name    : Core Media Metadata
  Stream #0:3(und): Data: none (mebx / 0x7862656D), 0 kb/s (default)
    Metadata:
      creation_time   : 2021-06-24T11:19:37.000000Z
      handler_name    : Core Media Metadata
  Stream #0:4(und): Data: none (mebx / 0x7862656D), 34 kb/s (default)
    Metadata:
      creation_time   : 2021-06-24T11:19:37.000000Z
      handler_name    : Core Media Metadata
Stream mapping:
  Stream #0:0 -> #0:0 (hevc (native) -> h264 (libx264))
  Stream #0:1 -> #0:1 (aac (native) -> aac (native))
Press [q] to stop, [?] for help
[libx264 @ 0x7fbe08009400] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2
[libx264 @ 0x7fbe08009400] profile High, level 4.0, 4:2:0, 8-bit
[libx264 @ 0x7fbe08009400] 264 - core 163 r3060 5db6aa6 - H.264/MPEG-4 AVC codec - Copyleft 2003-2021 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00
Output #0, mp4, to 'video.mp4':
  Metadata:
    major_brand     : qt
    minor_version   : 0
    compatible_brands: qt
    com.apple.quicktime.creationdate: 2021-06-24T13:19:36+0200
    com.apple.quicktime.location.accuracy.horizontal: 65.000000
    com.apple.quicktime.location.ISO6709: +52.5554+013.3772+042.268/
    com.apple.quicktime.make: Apple
    com.apple.quicktime.model: iPhone 8
    com.apple.quicktime.software: 14.4.2
    encoder         : Lavf58.76.100
  Stream #0:0(und): Video: h264 (avc1 / 0x31637661), yuv420p(tv, bt709, progressive), 1080x1920, q=2-31, 30 fps, 15360 tbn (default)
    Metadata:
      encoder         : Lavc58.134.100 libx264
      creation_time   : 2021-06-24T11:19:37.000000Z
      handler_name    : Core Media Video
      vendor_id       : [0][0][0][0]
    Side data:
      cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A
      displaymatrix: rotation of -0.00 degrees
  Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 44100 Hz, mono, fltp, 69 kb/s (default)
    Metadata:
      creation_time   : 2021-06-24T11:19:37.000000Z
      handler_name    : Core Media Audio
      vendor_id       : [0][0][0][0]
      encoder         : Lavc58.134.100 aac
frame=  295 fps= 18 q=-1.0 Lsize=    9980kB time=00:00:09.82 bitrate=8323.6kbits/s speed=0.614x
video:9884kB audio:85kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.103401%
[libx264 @ 0x7fbe08009400] frame I:6     Avg QP:21.84  size: 50301
[libx264 @ 0x7fbe08009400] frame P:217   Avg QP:24.23  size: 35435
[libx264 @ 0x7fbe08009400] frame B:72    Avg QP:24.97  size: 29582
[libx264 @ 0x7fbe08009400] consecutive B-frames: 66.8%  0.7%  4.1% 28.5%
[libx264 @ 0x7fbe08009400] mb I  I16..4:  6.7% 88.9%  4.4%
[libx264 @ 0x7fbe08009400] mb P  I16..4:  4.2% 32.9%  0.5%  P16..4: 49.6%  5.0%  2.3%  0.0%  0.0%    skip: 5.5%
[libx264 @ 0x7fbe08009400] mb B  I16..4:  0.9%  6.9%  0.0%  B16..8: 51.8%  7.7%  0.8%  direct:12.2%  skip:19.6%  L0:50.1% L1:46.9% BI: 3.0%
[libx264 @ 0x7fbe08009400] 8x8 transform intra:87.7% inter:88.8%
[libx264 @ 0x7fbe08009400] coded y,uvDC,uvAC intra: 59.2% 41.2% 0.9% inter: 38.3% 57.3% 0.0%
[libx264 @ 0x7fbe08009400] i16 v,h,dc,p: 15% 23%  8% 53%
[libx264 @ 0x7fbe08009400] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 16% 14% 29%  7%  8%  7%  7%  6%  6%
[libx264 @ 0x7fbe08009400] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 23% 18% 18%  9% 11%  7%  7%  4%  3%
[libx264 @ 0x7fbe08009400] i8c dc,h,v,p: 60% 20% 18%  1%
[libx264 @ 0x7fbe08009400] Weighted P-Frames: Y:3.2% UV:2.3%
[libx264 @ 0x7fbe08009400] ref P L0: 54.2% 10.7% 22.2% 12.5%  0.4%
[libx264 @ 0x7fbe08009400] ref B L0: 75.5% 19.6%  4.9%
[libx264 @ 0x7fbe08009400] ref B L1: 88.7% 11.3%
[libx264 @ 0x7fbe08009400] kb/s:8234.01
[aac @ 0x7fbe0802e000] Qavg: 446.889
0

There are 0 best solutions below