下面以抖音为例,原创提供的代码不完整,修复了一下。
API仅供学习交流使用,禁止用于商业用途、违法用途等,否则后果自负!
因为工作需要,研究了一下抖音的视频接口,都需要 cookie
、 msToken
、 a_bogus
....这些参数才能调用获取解析,而且 a_bogus
算法且复杂, cookie
也要时长更新,而我这个能跳过这些参数,直接能解析视频信息出来,无需 cookie
、 a_bogus
,且解析的视频一直都不会失效。
解析前:
解析后:
实现原理
当我们请求手机端的抖音链接的时候,在 F12
获取源代码能得到一些视频信息 json
数据,所以只需要正则取到源代码下面的 json
数据,请求头只需添加 User-Agent
、 Referer
即可,无需 cookie
。
模拟请求,得到了需要的数据,再使用正则取出转成json数据再取出值即可。
注意事项
原贴:抖音视频链接 重定向错误分析
https://www.douyin.com/aweme/v1/play/?video_id=v0200fg10000cqd77sfog65gatpq9nm0
该链接是抖音无水印播放接口,且每次访问都会跳转到新的地址,按道理视频一直都不会失效, video_id
参数在页面能搜索到。
贴上脚本
PHP版:
<?php
$msg = urldecode($_REQUEST['msg']); //获取视频链接
if (is_numeric($msg)) {
$video_id = $msg;
} else {
preg_match('/https?:\/\/[^\s]+/', $msg, $video_url);
$video_url = $video_url[0];
$redirected_url = get_redirected_url($video_url);
preg_match('/(\d+)/', $redirected_url, $matches);
$video_id = $matches[1];
// echo $video_id;
}
function get_redirected_url($url) {
$ch = curl_init($url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);
curl_exec($ch);
$redirected_url = curl_getinfo($ch, CURLINFO_EFFECTIVE_URL);
curl_close($ch);
return $redirected_url;
}
$headers = [
'User-Agent: Mozilla/5.0 (Linux; Android 8.0.0; SM-G955U Build/R16NW) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/116.0.0.0 Mobile Safari/537.36',
'Referer: https://www.douyin.com/?is_from_mobile_home=1&recommend=1'
];
$url = "https://www.iesdouyin.com/share/video/$video_id/";
$ch = curl_init($url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_HTTPHEADER, $headers);
$response = curl_exec($ch);
curl_close($ch);
preg_match('/_ROUTER_DATA\s*=\s*(\{.*?\});/', $response, $matches);
$data = $matches[1];
// 解析 JSON 数据
$jsonData = json_decode($data, true);
// 获取视频信息
$itemList = $jsonData['loaderData']['video_(id)/page']['videoInfoRes']['item_list'][0];
$nickname = $itemList['author']['nickname'];
$title = $itemList['desc'];
$awemeId = $itemList['aweme_id'];
$video = $itemList['video']['play_addr']['uri'];
$videoUrl = $video !== null ? (strpos($video, 'mp3') === false ? 'https://www.douyin.com/aweme/v1/play/?video_id=' . $video : $video) : null;
$cover = $itemList['video']['cover']['url_list'][0];
$images = $itemList['images'] ?? null;
$output = [
'msg' =>empty($nickname)?'解析失败!':'解析成功!',
'name' => $nickname,
'title' => $title,
// 'aweme_id' => $awemeId,
'video' => $videoUrl,
'cover' => $cover,
'images' => array_map(function($image) {
return $image['url_list'][0];
}, is_array($images) ? $images : []),
'type' =>empty($images)?'视频':'图集',
'tips' => '解析失败多试几次,Meiyan美言AI'
];
header('Content-Type: application/json');
echo json_encode($output,JSON_UNESCAPED_UNICODE | JSON_PRETTY_PRINT | JSON_UNESCAPED_SLASHES);
?>
Python版:
#Meiyan美言AI
import requests
import re
import json
headers ={
'user-agent':'Mozilla/5.0 (Linux; Android 8.0.0; SM-G955U Build/R16NW) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/116.0.0.0 Mobile Safari/537.36',
'referer':'https://www.douyin.com/?is_from_mobile_home=1&recommend=1'
}
url ='https://www.iesdouyin.com/share/video/7358413207083486505/'
res =requests.get(url,headers=headers).text
data =re.findall(r'_ROUTER_DATA\s*=\s*(\{.*?\});', res)[0]
json_data =json.loads(data)
# print(data)
item_list =json_data['loaderData']['video_(id)/page']['videoInfoRes']['item_list'][0]
nickname =item_list['author']['nickname']
title =item_list['desc']
aweme_id =item_list['aweme_id']
video =item_list['video']['play_addr']['uri']
video = 'https://www.douyin.com/aweme/v1/play/?video_id=' + video if 'mp3' not in video else video
cover =item_list['video']['cover']['url_list'][0]
img =item_list['images']
print(title)
print(aweme_id)
print(video)
print(cover)
for image in img:
images = image['url_list'][0]
print(images)
#Meiyan美言AI
修复版本
上面提供的代码和作者提供的演示不一样,于是增加了一些。
- 在
video
下增加play_video
字段:通过get_final_video_url()
函数获取视频的真实播放地址,如果获取失败则返回null
- 在
title
下新增aweme_id
字段:从原始URL或重定向后的URL中提取数字ID(如
https://www.douyin.com/video/XXX
提取XXX
) - 添加
header('Content-Type: application/json; charset=utf-8')
确保UTF-8
编码,所有字符串处理都确保UTF-8
兼容 - 增加更多空值检查(?? 运算符)更完善的正则表达式匹配
5.extract_aweme_id()
- 专门用于从URL提取数字IDget_final_video_url()
- 专门用于获取最终播放地址
修改后的php版代码:
<?php
header('Content-Type: application/json; charset=utf-8');
$msg = urldecode($_REQUEST['msg']); // 获取视频链接
// 获取aweme_id(从URL提取最终的数字ID)
function extract_aweme_id($url) {
$redirected = get_redirected_url($url);
preg_match('/\/(\d+)(?:\/|\?|$)/', $redirected, $matches);
return $matches[1] ?? null;
}
if (is_numeric($msg)) {
$video_id = $msg;
$aweme_id = $msg; // 如果直接传入数字,则认为是aweme_id
} else {
preg_match('/https?:\/\/[^\s]+/', $msg, $video_url);
$video_url = $video_url[0];
// 获取aweme_id(从原始URL或重定向后的URL)
$aweme_id = extract_aweme_id($video_url);
$redirected_url = get_redirected_url($video_url);
preg_match('/(\d+)/', $redirected_url, $matches);
$video_id = $matches[1];
}
function get_redirected_url($url) {
$ch = curl_init($url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);
curl_exec($ch);
$redirected_url = curl_getinfo($ch, CURLINFO_EFFECTIVE_URL);
curl_close($ch);
return $redirected_url;
}
function get_final_video_url($video_url) {
if (empty($video_url)) return null;
$ch = curl_init($video_url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, false); // 不自动跳转
curl_setopt($ch, CURLOPT_HEADER, true); // 获取头信息
curl_setopt($ch, CURLOPT_NOBODY, true); // 只获取头信息
curl_exec($ch);
$redirect_url = curl_getinfo($ch, CURLINFO_REDIRECT_URL);
curl_close($ch);
return $redirect_url ?: $video_url;
}
$headers = [
'User-Agent: Mozilla/5.0 (Linux; Android 8.0.0; SM-G955U Build/R16NW) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/116.0.0.0 Mobile Safari/537.36',
'Referer: https://www.douyin.com/?is_from_mobile_home=1&recommend=1'
];
$url = "https://www.iesdouyin.com/share/video/$video_id/";
$ch = curl_init($url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_HTTPHEADER, $headers);
$response = curl_exec($ch);
curl_close($ch);
preg_match('/_ROUTER_DATA\s*=\s*(\{.*?\});/', $response, $matches);
$data = $matches[1] ?? '{}';
// 解析 JSON 数据
$jsonData = json_decode($data, true);
// 获取视频信息
$itemList = $jsonData['loaderData']['video_(id)/page']['videoInfoRes']['item_list'][0] ?? [];
$nickname = $itemList['author']['nickname'] ?? '';
$title = $itemList['desc'] ?? '';
$video = $itemList['video']['play_addr']['uri'] ?? '';
// 修改点:在视频地址后添加 req_cdn_type 参数
$videoUrl = $video ? (strpos($video, 'mp3') === false
? 'https://www.douyin.com/aweme/v1/play/?video_id=' . $video . '&req_cdn_type='
: $video)
: null;
$cover = $itemList['video']['cover']['url_list'][0] ?? '';
$images = $itemList['images'] ?? null;
// 获取真实播放地址
$play_video = null;
if ($videoUrl) {
$play_video = get_final_video_url($videoUrl);
}
$output = [
'msg' => empty($nickname) ? '解析失败!' : '解析成功!',
'name' => $nickname,
'title' => $title,
'id' => $aweme_id, // 新增的aweme_id字段
'video' => $videoUrl,
'play_video' => $play_video, // 新增的真实播放地址
'cover' => $cover,
'images' => array_map(function($image) {
return $image['url_list'][0] ?? '';
}, is_array($images) ? $images : []),
'type' => empty($images) ? '视频' : '图集',
'tips' => '解析失败多试几次,Meiyan美言AI'
];
echo json_encode($output, JSON_UNESCAPED_UNICODE | JSON_PRETTY_PRINT | JSON_UNESCAPED_SLASHES);
?>
在获取到的 play_video
地址后面添加 &req_cdn_type=
参数版:
<?php
header('Content-Type: application/json; charset=utf-8');
$msg = urldecode($_REQUEST['msg']); // 获取视频链接
// 获取aweme_id(从URL提取最终的数字ID)
function extract_aweme_id($url) {
$redirected = get_redirected_url($url);
preg_match('/\/(\d+)(?:\/|\?|$)/', $redirected, $matches);
return $matches[1] ?? null;
}
if (is_numeric($msg)) {
$video_id = $msg;
$aweme_id = $msg; // 如果直接传入数字,则认为是aweme_id
} else {
preg_match('/https?:\/\/[^\s]+/', $msg, $video_url);
$video_url = $video_url[0];
// 获取aweme_id(从原始URL或重定向后的URL)
$aweme_id = extract_aweme_id($video_url);
$redirected_url = get_redirected_url($video_url);
preg_match('/(\d+)/', $redirected_url, $matches);
$video_id = $matches[1];
}
function get_redirected_url($url) {
$ch = curl_init($url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);
curl_exec($ch);
$redirected_url = curl_getinfo($ch, CURLINFO_EFFECTIVE_URL);
curl_close($ch);
return $redirected_url;
}
function get_final_video_url($video_url) {
if (empty($video_url)) return null;
// 初始化cURL
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $video_url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, false); // 不自动跳转
curl_setopt($ch, CURLOPT_HEADER, true); // 获取头信息
curl_setopt($ch, CURLOPT_NOBODY, true); // 只获取头信息
curl_exec($ch);
// 获取重定向URL
$redirect_url = curl_getinfo($ch, CURLINFO_REDIRECT_URL);
curl_close($ch);
// 如果没有重定向URL,返回原始URL
$final_url = $redirect_url ?: $video_url;
// 添加req_cdn_type参数 req_cdn_type=1 可以是空也可以是1/2/3
if (strpos($final_url, '?') === false) {
$final_url .= '?req_cdn_type=';
} else {
$final_url .= '&req_cdn_type=';
}
return $final_url;
}
$headers = [
'User-Agent: Mozilla/5.0 (Linux; Android 8.0.0; SM-G955U Build/R16NW) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/116.0.0.0 Mobile Safari/537.36',
'Referer: https://www.douyin.com/?is_from_mobile_home=1&recommend=1'
];
$url = "https://www.iesdouyin.com/share/video/$video_id/";
$ch = curl_init($url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_HTTPHEADER, $headers);
$response = curl_exec($ch);
curl_close($ch);
preg_match('/_ROUTER_DATA\s*=\s*(\{.*?\});/', $response, $matches);
$data = $matches[1] ?? '{}';
// 解析 JSON 数据
$jsonData = json_decode($data, true);
// 获取视频信息
$itemList = $jsonData['loaderData']['video_(id)/page']['videoInfoRes']['item_list'][0] ?? [];
$nickname = $itemList['author']['nickname'] ?? '';
$title = $itemList['desc'] ?? '';
$video = $itemList['video']['play_addr']['uri'] ?? '';
$videoUrl = $video ? (strpos($video, 'mp3') === false ? 'https://www.douyin.com/aweme/v1/play/?video_id=' . $video : $video) : null;
$cover = $itemList['video']['cover']['url_list'][0] ?? '';
$images = $itemList['images'] ?? null;
// 获取真实播放地址(已添加req_cdn_type参数)
$play_video = null;
if ($videoUrl) {
$play_video = get_final_video_url($videoUrl);
}
$output = [
'msg' => empty($nickname) ? '解析失败!' : '解析成功!',
'name' => $nickname,
'title' => $title,
'id' => $aweme_id, // 新增的aweme_id字段
'video' => $videoUrl,
'play_video' => $play_video, // 新增的真实播放地址(已添加req_cdn_type参数)
'cover' => $cover,
'images' => array_map(function($image) {
return $image['url_list'][0] ?? '';
}, is_array($images) ? $images : []),
'type' => empty($images) ? '视频' : '图集',
'tips' => '解析失败多试几次,Meiyan美言AI'
];
echo json_encode($output, JSON_UNESCAPED_UNICODE | JSON_PRETTY_PRINT | JSON_UNESCAPED_SLASHES);
?>
使用方法
现在只支持解析 图集
or 视频
,后续可能会添加 Live实况
解析。
请求示例: 域名?msg=视频链接
(msg可在代码$_REQUEST中换成其他)
请求方式:GET/POST
返回格式:JSON
例如:
视频链接:https://v.douyin.com/irC9f9js
请求地址:http://xxx.com/1.php?msg=https://v.douyin.com/irC9f9js
返回结果:
解析问题
- 有时候会解析失败,是因为代码没有算法Cookie
- URL重定向失效,平台经常更新域名规则,旧的重定向解析方式可能失效
- 平台前端经常改版,_ROUTER_DATA可能不存在或结构变化
- 反爬虫机制,平台对自动化请求有严格限制
- 网络环境问题,服务器IP被限制
- 视频权限限制,私密视频/作者设置限制
最右解析方法,几乎类似。
直接 F12
看分析返回内容,然后自己键值对取值即可
PHP代码:
<?php
$msg =$_GET['msg'];
$parsed_url = parse_url($msg);
parse_str($parsed_url['query'], $query_params);
$pid = (int)($query_params['pid'] ?? null);
$url = 'https://share.xiaochuankeji.cn/planck/share/post/detail_h5';
$data = [
'pid' => $pid,
'h_av' => '5.2.13.011'
];
$ch = curl_init($url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_POST, true);
curl_setopt($ch, CURLOPT_POSTFIELDS, json_encode($data));
curl_setopt($ch, CURLOPT_HTTPHEADER, [
'Content-Type: application/json'
]);
$response = curl_exec($ch);
$responseData = json_decode($response, true);
// 提取数据
$title = $responseData['data']['post']['content'];
$name = $responseData['data']['post']['member']['name'];
$imgs = $responseData['data']['post']['imgs'];
$id = $responseData['data']['post']['imgs'][0]['id'];
$videos = isset($responseData['data']['post']['videos'][strval($id)]) ? $responseData['data']['post']['videos'][strval($id)]['url'] : '';
foreach ($imgs as $img) {
$img['urls']['540_webp']['urls'][0] . "\n";
}
header('Content-Type: application/json');
echo json_encode([
'code' =>empty($id)?201:200,
'name' => $name,
'title' => $title,
'video' => $videos,
'imgs' => array_map(function ($img) {
return $img['urls']['540_webp']['urls'][0];
}, $imgs),
'tips' => '解析失败多试几次,Meiyan美言AI'
],JSON_UNESCAPED_UNICODE | JSON_PRETTY_PRINT | JSON_UNESCAPED_SLASHES);
?>
Python代码:
import re
import requests
msg ='https://share.xiaochuankeji.cn/hybrid/share/post?pid=402085918'
# msg输入你要解析的最右app分享地址
pid =re.findall('pid=(\d+)',msg)[0]
json = {"pid": int(pid), "h_av": "5.2.13.011"}
url = 'https://share.xiaochuankeji.cn/planck/share/post/detail_h5'
req = requests.post(url, json=json).json()
# 提取数据
title = req['data']['post']['content']
name = req['data']['post']['member']['name']
imgs = req['data']['post']['imgs']
id = req['data']['post']['imgs'][0]['id']
videos = req['data']['post']['videos'].get(str(id))['url']
print(name,title)
print('视频:',videos)
print('图集:')
for i in imgs:
img = i['urls']['540_webp']['urls'][0]
print(img)