Youtube
Youtube videos and comments are discovered and collected through YouTube channel subscriptions and generic keyword searches.
Average volumes - 885,833 new videos & 23,677,166 new comments.
Videos field name | Field description | Data example |
video | The parent element that contains information about an individual video | "video": { |
id | An ID generated by the crawler to uniquely identify each video. | yt.FauOVXgJ0zU |
siteid | A unique text value that identifies the video site (e.g. youtube). | youtube |
site_videoid | An identifier created by the video site to uniquely identify each video | FauOVXgJ0zU |
title | The title of the video. | maaf jelek |
description | The description of the video which is not always included. |
|
thumb_url_code | The image of the screenshot of the actual video. | |
tags | Tags that describe the content of the video. Not always included. | [] |
category | The category the video was assigned | People and Blogs |
published | The date/time the video was published (UTC) | 2024-08-13T13:11:01Z |
crawled | The date/time the video was harvested (UTC) | 2024-08-16T14:30:11Z |
duration | The length of the video in seconds | 17 |
videourl | The URL where the video was found. |
|
author | The parent element that contains author information | author: { |
author/name | The name of the author who submitted the video | Prinsca gaming |
author/site_authorid | An identifier created by the video site to uniquely identify each author | UCwmB2IO7xlu4fmTdH6Ul36A |
author/authorurl | The URL to the author's profile page | |
lang | The language code to identify the language used in comment. | id8 |
langid | The socialgist internal mapped id for this language | 40 |
comment | The parent element that contains information about an individual comment | comment: { |
id | An ID generated by the crawler to uniquely identify each video or comment. | yt.Ugxb8X5GgmyTo0M24FF4AaABAg |
videoid | The ID of the Video that the comment is associated with. This is used by BoardReader as the Thread ID. | yt.9IHwqdz8Xhw |
siteid | A unique text value that identifies the video site (e.g. youtube). | youtube |
site_commentid | An identifier created by the video site to uniquely identify each comment | UgxPGSMDXNWXumpJw8d4AaABAg |
title | Some video sites include a title for each comment. Most cases we just add part of the first sentence of the comment text. | অতীত এর ভুল থেকে শিক্ষা নিবেন। এ দেশ হতে আ লিগ শব্দ টা চিরতরে |
content | The text of the comment. | অতীত এর ভুল থেকে শিক্ষা নিবেন। এ দেশ হতে আ লিগ শব্দ টা চিরতরে কবর দিতে হবে জনগনের কাছে আইডল হতে হলে পরানতিক পর্যায়ে রিদয়ের ভিতর ডুকতে হলে জিয়াউর রহমানের আদর্শ উপলব্ধি করবেন। নির্বাচন নিয়ে মাথা….. |
published | The date/time the video was published (UTC) | 2024-08-15T18:56:03Z |
crawled | The date/time the video was harvested (UTC) | 2024-08-16T13:50:42Z |
videourl | The URL where the video was found. |
|
commentsurl | A URL that displays all the comments for the video. For YouTube this is limited to the most recent 1,000 comments. |
|
author | The author object element that contains information about the author of the comment | "author": { |
author/name | The comment author’s name or video account name | @kingtexzone5051 |
author/site_authorid | The video site specific id for the author | UCUb5_lzoVTJ67OienqU5FLQ |
author/authorurl | The url for the author of the comment. In some cases it can also be the author of the video. | |
author/profile_picture | The author profile avatar or picture. | |
lang | The language code to identify the language used in comment. | bn |
langid | The socialgist internal mapped id for this language | 8 |
SAMPLE MESSAGE TYPES
Video
{
"video": {
"id": "yt.MK6nBKODkVE",
"siteid": "youtube",
"site_videoid": "MK6nBKODkVE",
"title": "🟢 Thomas train exe vs Sonic the headgehog exe vs Siren Head vs Spider House Head 🌟 Who is best?",
"description": "🟢 Thomas train exe vs Sonic the headgehog exe vs Siren Head vs Spider House Head 🌟 Who is best? Tiles Hop EDM\n\n🎥🔥 Thanks for watching! Don't forget to leave a comment and be sure to watch to the end! 🙌💬🌟\n\n\n🎶 Welcome to the mesmerizing world of Adventure TilesHop! 🌟\n\nMy content uses some images and songs from other creators and brands to create music gaming videos for relaxation and entertainment. I have used these images and songs based on the fair use guidelines of Section 107 of the Copyright Act..\nThis content is intended for recreational and entertainment purposes only.\nThis is a music game. Please use headphones for the best experience.\n\nLike 👍\nComment ✍️\nSubscribe ✅\nShare 🙏\n\nFor more gaming content subscribe to Adventure TilesHop\n\n#tileshop #tileshopeveryday #coffindance #choochoocharles #sirenhead #mcqueeneater #thomastrainexe",
"thumb_url_code": "https://i.ytimg.com/vi/MK6nBKODkVE/default_live.jpg",
"tags": [
"tiles hop",
"tiles hop every day",
"choo choo charles tiles hop",
"lightning mcqueen tiles hop",
"skibidi toilet tiles hop",
"thomas train tiles hop",
"spider thomas tiles hop",
"coffin dance tiles hop",
"tiles hop edm rush",
"car eater tiles hop",
"sonic tiles hop",
"sonic exe tiles hop",
"death sonic tiles hop",
"siren head tiles hop",
"eater tiles hop",
"tiles hop song",
"coffin dance",
"sonic the hedgehog tiles hop",
"sonic exe coffin dance",
"siren head coffin dance",
"house head tiles hop",
"thomas train exe"
],
"category": "Gaming",
"published": "2024-08-16T11:28:45Z",
"crawled": "2024-08-16T15:47:20Z",
"duration": "3143",
"videourl": "https://www.youtube.com/watch?v=MK6nBKODkVE",
"author": {
"name": "Adventure TilesHop",
"site_authorid": "UCiW8EXSscOMq2pQ-lGsYMsw",
"authorurl": "http://youtube.com/channel/UCiW8EXSscOMq2pQ-lGsYMsw"
},
"lang": "en",
"langid": "22"
}
}
Comment
{
"comment": {
"id": "yt.UgxdiWsNRHBmpDlNggl4AaABAg.A79Qt_KSQPfA79h8Tq71DS",
"videoid": "yt.s7BjsDqHsto",
"siteid": "youtube",
"site_commentid": "UgxdiWsNRHBmpDlNggl4AaABAg.A79Qt_KSQPfA79h8Tq71DS",
"title": "How to tell I know nothing about motorcycle without saying I know nothing about motorcycle👆🏼",
"content": "How to tell I know nothing about motorcycle without saying I know nothing about motorcycle👆🏼",
"published": "2024-08-15T12:25:45Z",
"crawled": "2024-08-16T15:47:51Z",
"videourl": "https://www.youtube.com/watch?v=s7BjsDqHsto",
"commentsurl": "https://www.youtube.com/watch?v=s7BjsDqHsto&lc=UgxdiWsNRHBmpDlNggl4AaABAg.A79Qt_KSQPfA79h8Tq71DS",
"author": {
"name": "@surendarmohan9878",
"site_authorid": "UC-4A4p2rBFtcktdQfWLSgGg",
"authorurl": "http://www.youtube.com/@surendarmohan9878",
"profile_picture": "https://yt3.ggpht.com/ytc/AIdro_kx74ZYm1k0KoPfPQzXHltKaMnmZjeOLGI9rsIGnAgMQ0Ro=s48-c-k-c0x00ffffff-no-rj"
},
"lang": "en",
"langid": "22"
}
}