浏览代码

[fix] indexing ++ url extraction

Adam Tauber 11 年前
父节点
当前提交
01c2eeb8ff
共有 1 个文件被更改,包括 5 次插入2 次删除
  1. 5
    2
      searx/engines/yahoo.py

+ 5
- 2
searx/engines/yahoo.py 查看文件

47
     dom = html.fromstring(resp.text)
47
     dom = html.fromstring(resp.text)
48
 
48
 
49
     for result in dom.xpath(results_xpath):
49
     for result in dom.xpath(results_xpath):
50
-        url = parse_url(extract_url(result.xpath(url_xpath), search_url))
51
-        title = extract_text(result.xpath(title_xpath)[0])
50
+        try:
51
+            url = parse_url(extract_url(result.xpath(url_xpath), search_url))
52
+            title = extract_text(result.xpath(title_xpath)[0])
53
+        except:
54
+            continue
52
         content = extract_text(result.xpath(content_xpath)[0])
55
         content = extract_text(result.xpath(content_xpath)[0])
53
         results.append({'url': url, 'title': title, 'content': content})
56
         results.append({'url': url, 'title': title, 'content': content})
54
 
57