Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

爬取友联头像与作者错误 #136

Open
HeLongaa opened this issue Mar 19, 2024 · 0 comments
Open

爬取友联头像与作者错误 #136

HeLongaa opened this issue Mar 19, 2024 · 0 comments

Comments

@HeLongaa
Copy link

主题 halo -theme-hao
友联页面格式(部分):

<h2>
                    <a class="headerlink" href="#友情链接-24"
                       title="友情链接(24)"></a>
                    友情链接 (24)
                </h2>

                <div class="flink-desc">每个站点都值得一看</div>

                <!-- 第一个,使用卡片展示 -->
                

                <div class="flink-list">
                    <div class="flink-list-item">
                        <span style="background-color:#425AEF"
                              class="site-card-tag">荐</span>
                        <a class="cf-friends-link" rel="external nofollow" target="_blank" href="https://dusays.com"
                           title="杜老师说">
                            <img class="flink-avatar cf-friends-avatar" alt="杜老师说"

                                 src="/upload/lyszm17.gif"
                                 data-lazy-src="https://resources.blog.duolaa.asia/img/202402150119899.webp">
                            <div class="flink-item-info no-lightbox">
                                <span class="flink-item-name cf-friends-name">杜老师说</span>
                                <span class="flink-item-desc" title="师者,传道,授业,解惑!">师者,传道,授业,解惑!</span>
                                <img
                                     src="/upload/lyszm17.gif"
                                     data-lazy-src="https://resources.blog.duolaa.asia/img/202402150119899.webp">
                            </div>
                        </a>
                    </div>
                    <div class="flink-list-item">
                        <span style="background-color:#425AEF"
                              class="site-card-tag">荐</span>
                        <a class="cf-friends-link" rel="external nofollow" target="_blank" href="https://blog.zhheo.com/"
                           title="张洪Heo">
                            <img class="flink-avatar cf-friends-avatar" alt="张洪Heo"

                                 src="/upload/lyszm17.gif"
                                 data-lazy-src="https://bu.dusays.com/2022/12/28/63ac2812183aa.png">
                            <div class="flink-item-info no-lightbox">
                                <span class="flink-item-name cf-friends-name">张洪Heo</span>
                                <span class="flink-item-desc" title="分享设计与科技生活">分享设计与科技生活</span>
                                <img
                                     src="/upload/lyszm17.gif"
                                     data-lazy-src="https://bu.dusays.com/2022/12/28/63ac2812183aa.png">
                            </div>
                        </a>
                    </div>

使用butterfly模式爬取,Actions提示:链接,头像,名称长度不一致;
查看数据库发现:
image
相邻两个会被识别为同一个头像,还是错误的
建议适配该主题或提供解决方案

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant