-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Realtor.com - element changes #1
Comments
Hi, I did expect anyone gets interested in this ancient repo. So what a surprise to me. I haven't used python for years, so I don't remember the implementations. Will it be helpful to solve your problem with javascript? |
mainly looking for a solution in python, but I should be able to translate from javascript back to python. |
What exactly do you want as output? |
just wanting to parse out the data-url URLs elements.
…On Wed, May 15, 2019 at 10:15 AM pangrr ***@***.***> wrote:
What exactly do you want as output?
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#1?email_source=notifications&email_token=ADL6Z43AYWY3XZESQVQ37JLPVQSINA5CNFSM4HND35O2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODVO7VZY#issuecomment-492698343>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ADL6Z43OFNJSQSMCMUWRYZTPVQSINANCNFSM4HND35OQ>
.
|
|
|
yes. the data-url values.
…On Wed, May 15, 2019 at 10:46 AM pangrr ***@***.***> wrote:
Do you want the output to be those elements? the data-url values?
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#1?email_source=notifications&email_token=ADL6Z426M4IIO5JU54FU2IDPVQV3TA5CNFSM4HND35O2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODVPCWOI#issuecomment-492710713>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ADL6Z4ZUXYRPKYG5FWVIUG3PVQV3TANCNFSM4HND35OQ>
.
|
i solved my problem. i guess taking a break from it helps. i had to reference as unicode to obtain data.
|
Nice to hear that. Let me know if you need anything.
…On Wed, May 15, 2019 at 1:40 PM crushingbear ***@***.***> wrote:
i solved my problem. i guess taking a break from it helps. i had to
reference as unicode to obtain data.
soup = bs(requests.get(requestUrl + "/pg-" + str(page), headers=userAgent).text, "html.parser")
#print(soup)
pagedList = soup.findAll("div", {"class": "data-wrap"})
#pagedList = soup.findAll('div')
numberrec = len(pagedList)
#print(numberrec)
for propertyTag in pagedList:
try:
dataurl = propertyTag[u'data-url']
print(dataurl)
#print(propertyTag.attrs)
#print(u'data-url'.text)
#print(propertyTag.find(u"data-url").text)
#print(propertyTag.find("data-url")['href'].text)
# print(propertyTag.find(attrs={'data-url':'href'}).text)
# #print("Start tag:", tag)
except:
print("error")
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#1?email_source=notifications&email_token=ACHFQ7IFSQXN2XOD4NBDBYDPVRDHLA5CNFSM4HND35O2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODVPM3YA#issuecomment-492752352>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ACHFQ7KT7D6TO76RFU4MAHDPVRDHLANCNFSM4HND35OQ>
.
|
Hi there,
Can you give me some input here? I am trying to parse the data-url elements within the pages.
The text was updated successfully, but these errors were encountered: