INTERACT FORUM

Please login or register.

Login with username, password and session length
Advanced search  
Pages: [1]   Go Down

Author Topic: Properly decoded quotation marks in subtitles. Can it be done?  (Read 1029 times)

Sky King

  • Galactic Citizen
  • ****
  • Posts: 302
Properly decoded quotation marks in subtitles. Can it be done?
« on: February 10, 2024, 07:52:56 am »

I've noticed that subtitles are decoded by MC into a string of an odd character followed by the letters "quot;".  See attachment.

Is there a way to correct this from within MC or is it a problem with the subtitle information gleaned from the disc or other source?

Thank you.
Logged

Hendrik

  • Administrator
  • Citizen of the Universe
  • *****
  • Posts: 10935
Re: Properly decoded quotation marks in subtitles. Can it be done?
« Reply #1 on: February 12, 2024, 05:20:04 am »

Technically these are not supposed to even be in those subtitles, but we can try to handle them and replace them with normal characters. It should show up in MC32 in a future build.
Logged
~ nevcairiel
~ Author of LAV Filters

Sky King

  • Galactic Citizen
  • ****
  • Posts: 302
Re: Properly decoded quotation marks in subtitles. Can it be done?
« Reply #2 on: February 12, 2024, 06:09:26 am »

Looks like I'll have to upgrade to 32 sooner rather than later.  I believe those odd decodes are for quotation marks.

Thanks for taking a look at it.
Logged

lepa

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 2033
Re: Properly decoded quotation marks in subtitles. Can it be done?
« Reply #3 on: February 12, 2024, 08:42:41 am »

Probably ANSI encoded sub text muxed/saved as UTF or the otherway around
Logged

zybex

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 2619
Re: Properly decoded quotation marks in subtitles. Can it be done?
« Reply #4 on: February 12, 2024, 09:01:02 am »

" is XML or HTML encoding. Maybe subs saved from some web page.
Maybe subs that contain <​b> <​i> and similar HTML tags will have other symbols such as double quotes encoded too, as per HTML spec.
Logged

Sky King

  • Galactic Citizen
  • ****
  • Posts: 302
Re: Properly decoded quotation marks in subtitles. Can it be done?
« Reply #5 on: February 12, 2024, 09:15:36 am »

Very possible.  There is a dot SRT file associated with this particular movie.
Logged

Hendrik

  • Administrator
  • Citizen of the Universe
  • *****
  • Posts: 10935
Re: Properly decoded quotation marks in subtitles. Can it be done?
« Reply #6 on: February 12, 2024, 09:55:22 am »

Maybe subs that contain <​b> <​i> and similar HTML tags will have other symbols such as double quotes encoded too, as per HTML spec.

SRT specifically supports b, i, u and some limited font tags, its not mean to really include other HTML features.
But I imagine its likely converted from WebVTT or some other format that supports/requires more markup.
Logged
~ nevcairiel
~ Author of LAV Filters

Sky King

  • Galactic Citizen
  • ****
  • Posts: 302
Re: Properly decoded quotation marks in subtitles. Can it be done?
« Reply #7 on: February 13, 2024, 07:59:18 am »

Update:

I went back and analyzed my description of the issues I saw.  The improperly decoded open and close quotation marks were on a file, pulled legally from a paid site of course, whose subtitles were described as English [CC] [eng](tx3g).  When I replaced that subtitle with the .SRT file no issues at all.
Logged
Pages: [1]   Go Up