You are here

tvsou Unable to obtain data, please help write an ini

33 posts / 0 new
Last post
kongjun95848
Offline
Donator
Joined: 2 months
Last seen: 1 month
tvsou Unable to obtain data, please help write an ini

tvsou Unable to obtain data, please help write an ini

m.tvsou.com/epg/94263ee0/

<span>07:00</span><a href='//m.51livetv.com/wiki/lm_440864/' target='_blank'>栏目</a><script type='text/javascript'>judgeTime('1650927600000','//www.51livetv.com/channel/1342/','1650928800000','//m.51livetv.com/wiki/l...);</script></li><li><span>07:20</span><a href='//m.51livetv.com/wiki/zzdms/' target='_blank'>郑州大民生</a><script type='text/javascript'>judgeTime('1650928800000','//www.51livetv.com/channel/1342/','1650931200000','//m.51livetv.com/wiki/z...);</script></li><li><span>08:00</span>郑州新闻联播/直通政务<script type='text/javascript'>judgeTime('1650931200000','//www.51livetv.com/channel/1342/','1650932700000','');</script></li><li><span>08:25</span>县区政务<script type='text/javascript'>judgeTime('1650932700000','//www.51livetv.com/channel/1342/','1650935400000','');</script></li><li><span>09:10</span><a href='//m.51livetv.com/wiki/jzyzm/' target='_blank'>电视剧:决战燕子门31</a><a href='//m.51livetv.com/fenji/jzyzm_31.htm' target='_blank' target="_blank" style="font-size: 13px;color: #d90024;margin-left: 10px;">第31集剧情</a><a href='//m.51livetv.com/yyb/jzyzm/' target='_blank' target="_blank" style="font-size: 13px;color: #d90024;margin-left: 10px;">演员表</a><script type='text/javascript'>judgeTime('1650935400000','//www.51livetv.com/channel/1342/','1650937800000','//m.51livetv.com/wiki/j...);</script></li><li><span>09:50</span><a href='//m.51livetv.com/wiki/jzyzm/' target='_blank'>电视剧:决战燕子门32</a><a href='//m.51livetv.com/fenji/jzyzm_32.htm' target='_blank' target="_blank" style="font-size: 13px;color: #d90024;margin-left: 10px;">第32集剧情</a><a href='//m.51livetv.com/yyb/jzyzm/' target='_blank' target="_blank" style="font-size: 13px;color: #d90024;margin-left: 10px;">演员表</a><script

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 6 years
Last seen: 6 hours

look at ur url_index.
u have a extra | at the end.
in this case it doesnt hurt anything but it shouldnt be these.

check your weebgrab log,it will show u that all shows were skipped because of missing title.
ur title scrub is bad.
sites like these can be confusing to new learners becasue in this case all the lines start wih <a href
but on this site the title is always the first one.
so just scrub it with single as u did and dont need to be fancy with the scrub as using single will always keep the first result.

ndex_title.scrub {single|<a href=|>|</a>|</a>}

kongjun95848
Offline
Donator
Joined: 2 months
Last seen: 1 month

index_title.scrub {single|||}

OK after replacement, thank you!

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 6 years
Last seen: 6 hours

i just noticed in your channel creation section u have.

index_site_id.modify {cleanup(removeduplicates=equal,100)}

it should be like this

index_site_id.modify {cleanup(removeduplicates link="index_site_channel")}

with what u have it will only remove duplicates in the site_id value and not the corresponding duplicate channel name.

=equal,100 is the default action so u dont need to specify it.doesnt hurt anything if you do though.

kongjun95848
Offline
Donator
Joined: 2 months
Last seen: 1 month

The specified objects of individual titles are different, and there is a lack of programs. How can I write them completely.

{{{{07:20郑州大民生judgeTime('1650928800000','//www.51livetv.com/channel/1342/','1650931200000','//m.51livetv.com/wiki/zzdms/');

  • 08:00郑州新闻联播/直通政务judgeTime('1650931200000','//www.51livetv.com/channel/1342/','1650932700000','');
  • 08:25县区政务judgeTime('1650932700000','//www.51livetv.com/channel/1342/','1650935400000','');
  • 09:10电视剧:决战燕子门31}}}}}

    Info ] Group (0) :
    [ Info ] update requested for - 1 - out of - 1 - channels for 1 day(s)
    [ Debug ]
    [ Info ] ( 1/1 ) MM.TVSOU.COM -- chan. (xmltv_id=郑州时政) -- mode Force
    [ Debug ] skipped show without a title at 26/04/2022 08:00:00
    [ Debug ] skipped show without a title at 26/04/2022 08:25:00
    [ Debug ] skipped show without a title at 26/04/2022 12:22:00
    [ Debug ] skipped show without a title at 26/04/2022 19:33:00
    [ Debug ] skipped show without a title at 26/04/2022 19:55:00
    [ Debug ] skipped show without a title at 26/04/2022 22:00:00
    [ Debug ] skipped show without a title at 26/04/2022 22:25:00
    [ Debug ] skipped : last show, no next startime to use as stop
    [ Info ]
    [ Debug ]
    [ Debug ] 26 shows in 1 channels
    [ Debug ] 0 updated shows
    [ Debug ] 26 new shows added
    [ Info ]
    [ Info ]
    [ ] Job finished at 26/04/2022 11:20:52 done in 1s

    家国记忆

    栏目

    郑州大民生

    电视剧:决战燕子门31

    家国记忆

    栏目

    郑州大民生

    电视剧:决战燕子门31

  • Attachments: 
    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours

    the mobile site is a mess.
    title are in multiple different tags.
    have you checked the non mobile site?
    i had a quick look and things seem to use all the same tags.
    i think you should try that.

    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    non mobile site different labels are also used, and the reaction is strong。How to write?

    https://m.tvsou.com/epg/94263ee0/w2
    ndex_title.scrub {single| | < / a > | < / a > }

    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours

    [ Info ] found: /root/.wg++/siteini.pack/China/tvsou.com.ini -- Revision 03

    your not using the correct ini,revision 3 is the old one.
    the new one is revision 4.
    after you download the file,you have to rename them and remove the underscores,webgrab add these to all uploads for security reasons.

    works fine for me..
    found: /raiddata/0/NAS_WebGrab/siteini.user/China/tvsou.com.ini -- Revision 04 <====== new file revison number

    update requested for - 1 - out of - 1 - channels for 1 day(s)
    ( 1/1 ) TVSOU.COM -- chan. (xmltv_id=河南: 郑州电视台) -- mode Force
    innnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
    1.67 sec/update

    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    OK!OK!OK!Thanks again!Already successful
    It's my carelessness.

    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    What statements need to be added if it is regenerated into a GZ format?

    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours

    webgrab cannot do this.
    you have to comprerss the file after webgrab has completed.

    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    The program name is not displayed after the TV play,Incomplete.The program introduction is no longer displayed.How to modify data captured for 2 days?
    https://www.tvsou.com/epg/94263ee0/
    programme start="20220427003000 +0800" stop="20220427013000 +0800" channel="hnzzsz"
    title lang="zh">电视剧/title
    programme
    programme start="20220427013000 +0800" stop="20220427030000 +0800" channel="hnzzsz"
    title lang="zh"电视剧/title
    programme
    programme start="20220427030000 +0800" stop="20220427040000 +0800" channel="hnzzsz"
    title lang="zh"电视剧/title
    programme
    programme start="20220427040000 +0800" stop="20220427050000 +0800" channel="hnzzsz"
    title lang="zh"电视剧/title
    programme
    programme start="20220427050000 +0800" stop="20220427055000 +0800" channel="hnzzsz"
    title lang="zh"电视剧/title>
    programme
    programme start="20220427055000 +0800" stop="20220427070000 +0800" channel="hnzzsz"
    title lang="zh"家国记忆/title

    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours
    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours

    the main site(non mobile) was having issues this morning,details page was not getting downloaded.

    i made some tweaks to the ini and added did the mobile site also which seemed to work better.
    both are available on the epg channels page under china or do a siteini.pack update.

    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    mobile site Some codes of the program appear. Can they be shielded? Add what shielding.Refer to the data I downloaded。

    non mobile There is no data for individual channels, and the detailed information is still not available.

    title lang="zh">聚焦双改<script type='text/javascript'>judgeTime('1651209000000','//www.51livetv.com/channel/1343/','1651209300000','');</script>

    Attachments: 
    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours

    does this look ok?
    same 3 channels,regular site and mobile.
    looks the same to me.

    Attachments: 
    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    Some channels EPG comes with some web source code,You try 94263ee0, some programs come with source code.

    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours

    think i got that fixed also..

    Attachments: 
    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    / span > | | < / td> | < / td > Change to
    / span > | | < script type= | > | < script type= | > Garbled code is normal。Upload an attachment, do you see it right?

    Attachments: 
    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    Next, prepare to donate members

    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours

    that wont work because that scrub would fail for channel with data like this..

    <li>
     <span>18:30</span>
      省新闻<td></td>
    </li>

    and thats what its used for.

    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours

    try these

    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    Modified TD as script type = and unmodified 2 comparisons.TD not found on mobile terminal.

    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    The attachment you uploaded now is completely normal, and the test passed. Thanks again!

    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours

    i made some small changes.
    i decided to not separate episode number from subtitle,wg sometimes messes this up.
    added channel logo for non mobile site,mobile site does not have them.

    files updated above,no revision number change.

    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    https://epg.sports8.net/

    Can this station also write an ini for standby.

    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours

    already had it done.

    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    Thank you!

    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month
    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours

    where is the first link with the json data from?
    must be from a app?
    do you have the rest of the links for this also like channel,city/region,details page link?

    the second url cannot be used because it only shows part of the day schedule,the rest of the day is generated in javascript code and its a encrypted string thats base64 encoded.
    even if it could be figured out webgrab cannot grab 2 url's to get the full day schedule.

    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    Irst link with It is the interface address of an IPTV, and it is the data in the second one. Where 《epgCode=channel name》, the channel name is the program name of the second website.Live program EPG of the day。

    Blackbear199
    Offline
    Blackbear199's picture
    WG++ Team memberDonator
    Joined: 6 years
    Last seen: 6 hours

    this uses tvmao.com
    its very slow because to get the full day schedule the epg grid page needs to be used and its in 2 hour sections so 12 pages need to be grabbed to get 1 day of epg.

    its title only also.
    you could could create a ini to use the lighttv.tvmao.com url you posted above as the site_id="xxx" does have the correct channel ids it uses,you just need to change the channel creation section to keep only that or substring the value you need in scope=urlindex.
    hint: global temp_2 already does this but its used to separate the correct channel in the showsplit and not for the url_index.

    u seem to be somewhat knowledged in how ini work,it shouldnt be hard to figure this out.

    kongjun95848
    Offline
    Donator
    Joined: 2 months
    Last seen: 1 month

    I am a Chinese user ,Donation Tips:Donations to this recipient are not supported in this country or region.

    Log in or register to post comments

    Brought to you by Jan van Straaten

    Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
    Supported by: servercare.nl