[Python] python在线抓取百度词典的翻译结果翻译单词 →→→→→进入此内容的聊天室

来自 , 2019-09-03, 写在 Python, 查看 105 次.
URL http://www.code666.cn/view/a1b07b89
  1. import urllib
  2. import codecs
  3. from BeautifulSoup import BeautifulSoup
  4. from sys import argv
  5. import re,time
  6.  
  7. class Translate:
  8.     def Start(self):
  9.         self._get_html_sourse()
  10.         self._get_content("enc")
  11.         self._remove_tag()
  12.         self.print_result()
  13.  
  14.     def _get_html_sourse(self):
  15.         word=argv[1] if len(argv)>1 else ''
  16.         url="http://dict.baidu.com/s?wd=%s&tn=dict" %  word
  17.         self.htmlsourse=unicode(urllib.urlopen(url).read(),"gb2312","ignore").encode("utf-8","ignore")
  18.  
  19.     def _get_content(self,div_id):
  20.         soup=BeautifulSoup("".join(self.htmlsourse))
  21.         self.data=str(soup.find("div",{"id":div_id}))
  22.  
  23.     def _remove_tag(self):
  24.         soup=BeautifulSoup(self.data)
  25.         self.outtext=''.join([element  for element in soup.recursiveChildGenerator() if isinstance(element,unicode)])
  26.  
  27.     def print_result(self):
  28.         for item in range(1,10):
  29.             self.outtext=self.outtext.replace(str(item),"\n%s" % str(item))
  30.         self.outtext=self.outtext.replace("  ","\n")
  31.         print self.outtext
  32.  
  33. #from sharejs.com
  34. if __name__=="__main__":
  35.      Translate().Start()
  36. #//python/8530

回复 "python在线抓取百度词典的翻译结果翻译单词"

这儿你可以回复上面这条便签

captcha