- From: Jan Oberhauser <jan@link.fish>
- Date: Sat, 7 Oct 2017 11:12:43 +0200
- To: Piero Savastano <piero.savastano@gmail.com>
- Cc: "schema.org Mailing List" <public-schemaorg@w3.org>, Ed Summers <ehs@pobox.com>, Dan Brickley <danbri@google.com>
- Message-ID: <CADXeypQnbVFCK4n3GvrNW3uKNuiVVCiaM2fWYX18j-eV4_fKew@mail.gmail.com>
Ah yes it looks like if they made some changes. The player seems to appear much faster than before. Knowing now that they add the schema.org data depending on the user-agent I could at least solve my problem. As long as I overwrite it for that request to something else like 'asdf' I get the data I need. So thanks for the help! However would still be interesting to know why they do that. blue skies Jan Oberhauser Founder link.fish link.fish UG (haftungsbeschränkt) Wilhelm-Kuhr-Str. 43 13359 Berlin email | jan@link.fish web | https://link.fish <http://link.fish> Registergericht AG Charlottenburg, HRB 171276 B Geschäftsführer: Jan Oberhauser On Fri, Oct 6, 2017 at 8:23 PM, Piero Savastano <piero.savastano@gmail.com> wrote: > Maybe they are refactoring some code? > > On Oct 6, 2017 20:01, "Jan Oberhauser" <jan@link.fish> wrote: > >> Ah yes the same for Chromium: >> curl -H "User-Agent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 >> (KHTML, like Gecko) Ubuntu Chromium/59.0.3071.109 Chrome/59.0.3071.109 >> Safari/537.36" --silent https://www.youtube.com/watch?v=CHSIYGP0l_w | >> grep itemprop >> >> Interestingly that worked till a few weeks ago. Crawl all pages with >> Chromium to get the pages rendered properly and until now never had any >> problems with that. Wonder why they would do that. Do they want save the >> few bytes of HTML and few additional CPU cycles? At least thats the >> only advantage I can think off. >> >> >> blue skies >> >> Jan Oberhauser >> Founder link.fish >> >> >> link.fish UG (haftungsbeschränkt) >> Wilhelm-Kuhr-Str. 43 >> <https://maps.google.com/?q=Wilhelm-Kuhr-Str.+4313359+Berlin&entry=gmail&source=g> >> 13359 Berlin >> <https://maps.google.com/?q=Wilhelm-Kuhr-Str.+4313359+Berlin&entry=gmail&source=g> >> >> email | jan@link.fish >> web | http://link.fish >> >> Registergericht AG Charlottenburg, HRB 171276 B >> Geschäftsführer: Jan Oberhauser >> >> On Fri, Oct 6, 2017 at 7:41 PM, Ed Summers <ehs@pobox.com> wrote: >> >>> >>> > On Oct 6, 2017, at 12:00 PM, Ed Summers <ehs@pobox.com> wrote: >>> > >>> > >>> >> On Oct 6, 2017, at 11:58 AM, Dan Brickley <danbri@google.com> wrote: >>> >> >>> >> On 6 October 2017 at 16:56, Ed Summers <ehs@pobox.com> wrote: >>> >>> This seems to indicate otherwise? >>> >>> >>> >>> curl --silent https://www.youtube.com/watch?v=CHSIYGP0l_w | grep >>> itemprop >>> > >>> > What are they investigating? There appears to be microdata being >>> served. >>> >>> My mistake. If you tell curl to printed to be Firefox no microdata is >>> returned. >>> >>> curl --header "User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.12; >>> rv:55.0) Gecko/20100101 Firefox/55.0" https://www.youtube.com/watch? >>> v=CHSIYGP0l_w |grep itemprop >>> >>> ...yields nothing >>> >>> >>
Received on Saturday, 7 October 2017 09:13:39 UTC