Browse Source

Merge pull request #1021 from misnyo/google_news

[fix] google news dom xpath fix
Adam Tauber 7 years ago
parent
commit
806cb08750
2 changed files with 58 additions and 8 deletions
  1. 2
    2
      searx/engines/google_news.py
  2. 56
    6
      tests/unit/engines/test_google_news.py

+ 2
- 2
searx/engines/google_news.py View File

67
     for result in dom.xpath('//div[@class="g"]|//div[@class="g _cy"]'):
67
     for result in dom.xpath('//div[@class="g"]|//div[@class="g _cy"]'):
68
         try:
68
         try:
69
             r = {
69
             r = {
70
-                'url': result.xpath('.//div[@class="_cnc"]//a/@href')[0],
71
-                'title': ''.join(result.xpath('.//div[@class="_cnc"]//h3//text()')),
70
+                'url': result.xpath('.//a[@class="l _PMs"]')[0].attrib.get("href"),
71
+                'title': ''.join(result.xpath('.//a[@class="l _PMs"]//text()')),
72
                 'content': ''.join(result.xpath('.//div[@class="st"]//text()')),
72
                 'content': ''.join(result.xpath('.//div[@class="st"]//text()')),
73
             }
73
             }
74
         except:
74
         except:

+ 56
- 6
tests/unit/engines/test_google_news.py
File diff suppressed because it is too large
View File