Commit graph

12770 commits

Author SHA1 Message Date
Yen Chi Hsuan
d19093bd50 Merge pull request #12906 from Tithen-Firion/clean-html-fix
[utils] Fix inconsistent output of clean_html
2017-04-29 15:58:45 +08:00
Yen Chi Hsuan
24eb7c2578
[xtube] Fix extraction with non-standard JSON 'sources'
Closes #12734

Thanks @paulguy for the fix!
2017-04-29 15:55:08 +08:00
Sergey M․
e7db6759e4
[downloader/external] Properly handle live stream downloading cancellation (closes #8932) 2017-04-29 04:33:35 +07:00
Sergey M․
b364c87c42
[tvplayer] Fix extraction (closes #12908) 2017-04-29 03:46:08 +07:00
Tithen-Firion
edd9221cd2 [utils] Fix inconsistent output of clean_html
`\s` in Python 2.x doesn't match unicode whitespace characters by
default
2017-04-28 17:34:27 +02:00
Sergey M․
bc8a2ea071
release 2017.04.28 2017-04-28 18:30:03 +07:00
Remita Amine
20783b8b50 [aenetworks] fix extraction for shows with single season 2017-04-28 12:04:56 +01:00
Remita Amine
bf2a5555c0 [go] add support for Disney, DisneyJunior and DisneyXD show pages 2017-04-28 09:48:52 +01:00
Remita Amine
fb8e8b2d16 [adobepass] use geo verification headers for all requests 2017-04-28 09:48:52 +01:00
Yen Chi Hsuan
b62985a9a5
[youtube] Recognize another HTML5 player URL (#12885) 2017-04-28 16:25:04 +08:00
Yen Chi Hsuan
e31fed95b4
[youtube] Recognize new locale-based player URLs (fixes #12885) 2017-04-28 15:48:30 +08:00
Tithen-Firion
3fd0f70f6a [cbslocal] Update test 2017-04-28 04:26:59 +07:00
Tithen-Firion
33c62efc32 [collegerama] Update tests 2017-04-28 04:00:49 +07:00
Tithen-Firion
6b4ddd336c [afreecatv] Fix title extraction 2017-04-28 04:00:15 +07:00
Tithen-Firion
c12b4b80f8 [archiveorg] Update test 2017-04-28 03:48:32 +07:00
Tithen-Firion
064fafe932 [appleconnect] Update test 2017-04-28 03:47:25 +07:00
Tithen-Firion
ac1a5b9a12 [audioboom] Update test 2017-04-28 03:36:28 +07:00
Tithen-Firion
a15777491a [atresplayer] Update test 2017-04-28 03:32:25 +07:00
Tithen-Firion
d8571dd6bf [bleacherreport] Update tests 2017-04-28 03:28:26 +07:00
Sergey M․
c0fa4245ce
[downloader/fragment] Remove assert for resume_len when no fragments downloaded
This may be incorrect due some header (e.g. flv header in f4m downloader)
2017-04-28 03:26:19 +07:00
Tithen-Firion
8814ae42bc [beeg] Update test 2017-04-28 03:14:11 +07:00
Tithen-Firion
0f63dc2402 [bandcamp] Update test 2017-04-28 03:13:12 +07:00
Tithen-Firion
dde97ea8da [canalc2] Update test 2017-04-28 03:07:42 +07:00
Sergey M․
c89b49f743
[extractor/common] Add manifest_url for explicit group rendition formats 2017-04-28 03:00:14 +07:00
Tithen-Firion
6f4a888416 [br] Update test 2017-04-28 02:53:11 +07:00
Tithen-Firion
f5edd7ae51 [clipfish] Update test 2017-04-28 02:51:30 +07:00
Tithen-Firion
c95e2b5911 [cbc] update test cases 2017-04-27 18:07:07 +02:00
Sergey M․
ff99fe529e
Don't list master m3u8 playlists in format list (closes #12832) 2017-04-27 21:53:17 +07:00
Tithen-Firion
76c1951036 [appletrailers] update test cases 2017-04-27 10:04:21 +02:00
Lucas M
e8bfe2a946 [streamable] Add support for new embedded URL schema 2017-04-26 23:39:53 +07:00
Sergey M․
3dc8b61b7f
[arte:+7] Relax _VALID_URL (closes #12837) 2017-04-26 01:55:29 +07:00
Sergey M․
a82f41841d
release 2017.04.26 2017-04-26 00:06:12 +07:00
Sergey M․
ac9c69ace7
[extractor/common] Improve jwplayer regex 2017-04-25 23:46:05 +07:00
Sergey M․
85f6de25e4
[downloader/fragment] Clarify current_fragment's index and mark as experimental 2017-04-25 23:33:35 +07:00
Yen Chi Hsuan
01cb57016f
[iqiyi] Fix extraction of Yule videos 2017-04-25 22:23:57 +08:00
Sergey M․
290f64dbaa
[downloader/fragment] Improve .ytdl format and start documenting 2017-04-24 23:50:20 +07:00
Sergey M․
adb4b03cd5
[downloader/fragment] Don't process ytdl file when it's not needed yet 2017-04-24 23:05:56 +07:00
Sergey M․
0eee52f34b
Introduce --keep-fragments 2017-04-24 03:09:08 +07:00
Sergey M․
d3f0687cf7
[downloader/fragment] Use temp file for current fragment 2017-04-24 02:54:17 +07:00
Sergey M․
a4d6cf970c
[YoutubeDL] Fix output template for missing timestamp (closes #12796) 2017-04-24 00:50:39 +07:00
Sergey M․
3019cb0c99
[extractor/common] Rephrase comment 2017-04-23 11:52:07 +07:00
Sergey M․
ddd258f922
[test_InfoExtractor] Add m3u8 parsing test for NAME attribute in EXT-X-STREAM-INF tag 2017-04-23 11:49:57 +07:00
Sergey M․
07ad0cf34f
[vidio] Improve and sort formats 2017-04-23 11:48:51 +07:00
Sergey M․
9c99bef704
[extractor/common] Use float for scaled tbr 2017-04-23 11:33:49 +07:00
Remita Amine
ffbc8386b9 [brightcove] match only video elements with data-video-id attribute 2017-04-22 22:26:20 +01:00
Remita Amine
4abdba643c [downloader/fragment] remove unused code 2017-04-22 18:19:47 +01:00
Remita Amine
3e0304fe6e [downloader/fragment] use the documented names for fragment progress_hooks fields 2017-04-22 16:42:24 +01:00
Yen Chi Hsuan
fbf56be213
[iqiyi] Fix playlist detection (#12504) 2017-04-22 22:11:37 +08:00
Yen Chi Hsuan
54f54fcca7
[socks] Report errors elegantly when credentails are required but missing
In some non-standard implementations, the server may respond AUTH_USER_PASS
even if's not listed in available authentication methods. (it should
respond AUTH_NO_ACCEPTABLE per standards)
2017-04-22 21:48:41 +08:00
Yen Chi Hsuan
facfd79f9a
[azubu] Remove extractor as the site is gone (closes #12813) 2017-04-22 21:20:25 +08:00
Yen Chi Hsuan
3110bb937d
[porn91] Fix extraction (closes #12814) 2017-04-22 21:16:36 +08:00
Sergey M․
cb2520802d
[extractor/common] Improve m3u8 extraction (closes #12211)
* Extract m3u8 parsing to separate method
* Improve rendition groups extraction
* Build stream name according stream GROUP-ID
* Ignore reference to AUDIO group without URI when stream has no CODECS
+ Add test coverage for parsing m3u8 from #11507, #11995, #12211 and twitch vod
2017-04-22 07:01:00 +07:00
Sergey M․
f779958250
[vidzi] Fix extraction (closes #12793) 2017-04-21 23:37:06 +07:00
Remita Amine
8abc7dca39 [amp] extract error message(closes #12795) 2017-04-20 05:16:41 +01:00
Remita Amine
ea0c2f219c [downloader/fragment] use a general file to store fragment download context 2017-04-19 18:53:15 +01:00
Sergey M․
481ef51e23
[brightcove] PEP 8 2017-04-19 21:47:03 +07:00
Remita Amine
5b995f713b [utils] add support for ttml styles 2017-04-19 14:38:40 +01:00
Remita Amine
75a2485407 [fragment,hls,f4m,dash,ism] improve fragment downloading
- resume immediately
- no need to concatenate segments and decrypt them on every resume
- no need to save temp files for segments

and for hls downloader:
- no need to download keys for segments that already downloaded
2017-04-19 11:46:07 +01:00
Remita Amine
58f6ab72ed [odnoklassniki] update tests 2017-04-19 00:16:55 +01:00
Sergey M․
2dc48df5bc
[xfileshare] Add support for gorillavid.com and daclips.com (closes #12776) 2017-04-18 23:58:37 +07:00
Sergey M․
18848d226a
[instagram] Fix extraction (closes #12777) 2017-04-18 22:40:26 +07:00
Sergey M․
a32a9a7ef5
[extractor/common] Add support multiple getters in try_get 2017-04-18 22:39:58 +07:00
Sergey M․
bae1404893
[extractor/common] Add support for video of WebPage context in _json_ld (closes #12778) 2017-04-18 22:21:38 +07:00
Yen Chi Hsuan
06d0ad9a4e
[brightcove] Support URLs with bcpid instead of playerID
Fixes #12482
2017-04-18 23:04:22 +08:00
Sergey M․
f631b55791
[brightcove] Fix _extract_url (closes #12782) 2017-04-18 21:46:25 +07:00
Remita Amine
bf1b87cd91 [common] Relax JWPlayer regex and remove duplicate urls(#12768) 2017-04-17 08:48:24 +01:00
Remita Amine
1c35b3da44 [odnoklassniki] extract m3u8 formats 2017-04-16 21:27:08 +01:00
Sergey M․
16a09aefe3
release 2017.04.17 2017-04-17 00:46:32 +07:00
Sergey M․
ab87c26009
[extractor/generic] Add test for #12761 2017-04-17 00:36:59 +07:00
Sergey M․
91bc57e4c5
[limelight] Add support for channels and channels lists embedded using LimelightEmbeddedPlayerFlash (#12761) 2017-04-17 00:33:47 +07:00
Sergey M․
e5d39886ec
[limelight] Improve embeds extraction (closes #12761)
* Move extraction code to extractor
* Add extraction for LimelightEmbeddedPlayerFlash embeds
* Extract multiple video
2017-04-17 00:23:16 +07:00
Sergey M․
751c89a27d
[itv] Extract series metadata 2017-04-16 23:19:20 +07:00
Sergey M․
c2d7d76efd
[itv] Fix rtmp formats (#12759) 2017-04-16 23:15:24 +07:00
Sergey M․
f67177cae8
[itv] Use native hls 2017-04-16 21:52:45 +07:00
Sergey M․
5935ef3c5d
[itv] Lower preference for rtmp formats (closes #12759) 2017-04-16 21:52:07 +07:00
remitamine
72950c4dce Merge pull request #12752 from triple-j/go90_improvements_pull_request
[go90] Improve extraction
2017-04-15 23:48:13 +02:00
Jeremie J. Jarosh
334f41e0d8 [go90] Improve extraction
- add metadata for 'series', 'episode', 'season', 'season_id',
  'season_number', and 'episode_number'
- integrate series title into the title
- extract subtitles  (fallback to `vtt` if the subtitle file type
  detection fails as that is the most likely extension)
2017-04-15 16:32:00 -05:00
Sergey M․
022250a594
release 2017.04.16 2017-04-16 01:49:02 +07:00
Sergey M․
d35dc344af
[YoutubeDL] Apply expand_path after output template substitution
os.path.expandvars translates '%%' into '%' making output template invalid.
Before: '%%(ext)s' -(expand path)-> '%(ext)s' -(outtmpl subst.)-> 'mp4'
After: '%%(ext)s' -(outtmpl subst.)-> '%(ext)s' -(expand path)-> '%(ext)s'
2017-04-16 01:39:07 +07:00
Sergey M․
0563f7ac6e
[YoutubeDL] Propagate overridden metadata to IE results of type url (closes #11163) 2017-04-16 00:57:36 +07:00
Sergey M․
413c1f8e2f
[extractor/generic] Extract RSS entries as url_transparent (#11163) 2017-04-16 00:57:32 +07:00
Sergey M․
8068296276
[streamango] Improve extraction (closes #12643) 2017-04-15 21:50:51 +07:00
Marvin Ewald
4db79fa1bc
[streamango] Add extractor 2017-04-15 21:50:46 +07:00
Sergey M․
b2a19e3829
[wsj] Improve and modernize (closes #12558) 2017-04-15 20:59:05 +07:00
John Hawkinson
3266d08af2
[wsj:article] Add extractor 2017-04-15 20:58:57 +07:00
Sergey M․
0254f93b08
[brightcove] Validate ambiguous embeds' URLs (closes #9163, closes #12005, closes #12178, closes #12480) 2017-04-15 20:31:49 +07:00
John Hawkinson
40158f55c9
[brightcove] Relax video tag embeds extraction
BrightcoveNewIE.extract_urls(): Handle player data when it is all in
attributes of the <video> tag, and also the id is
data-brightcove-video-id not data-video-id.

Add test to generic extractor, note HEAD gives 404, no checksum computed(?).
2017-04-15 20:31:43 +07:00
Sergey M․
3dfceb286c
[udemy] Add support for react rendition (closes #12744) 2017-04-15 03:26:19 +07:00
Sergey M․
57e2cb6c3a
[curiositystream] PEP 8 2017-04-15 02:33:26 +07:00
Sergey M․
6f1b2374af
[canalplus] PEP 8 2017-04-15 02:33:12 +07:00
Sergey M․
d24bd4ffdb
release 2017.04.15 2017-04-15 01:39:33 +07:00
Sergey M․
34c3bbe1bd
[youku] Keep old fileid extraction code as fallback (#12741) 2017-04-15 01:28:02 +07:00
xuhaomin
e5106ba402 [youku] Fix fileid extraction 2017-04-14 13:23:02 -05:00
Sergey M․
38d2f8325f
release 2017.04.14 2017-04-14 00:31:49 +07:00
Sergey M․
95728fda70
[aenetworks] Add support for specials (closes #12723) 2017-04-13 23:39:55 +07:00
Remita Amine
3f7409f745 [hbo] extract m3u8 formats 2017-04-13 12:28:22 +01:00
Remita Amine
b2fff30817 [go90] Add new extractor(closes #10127) 2017-04-13 12:28:22 +01:00
Remita Amine
f59746372a [downloader/hls] add basic support for EXT-X-BYTERANGE tag(#10955) 2017-04-13 12:28:22 +01:00
Yen Chi Hsuan
34d98cc411
[tv2hu] Fix invalid escape sequence on Py3.6 2017-04-13 16:52:52 +08:00
Remita Amine
40fcba5edb improve coding style 2017-04-12 20:38:43 +01:00