求C＃提取网页正文内容代码解决思路-C#教程-爱易网页

求C＃提取网页正文内容代码解决思路

日期：2014-05-19　浏览次数：21114 次

求C＃提取网页正文内容代码
哪位大虾有C＃提取网页正文内容的代码，可不可以发上来我参考参考。谢谢啦！！

------解决方案--------------------
public static int saveHtmlFile(string url,string filename)
{
int status = -1;
string respHTML = string.Empty;
StreamWriter sw = null;
try
{
if(ReadHttp(url,ref respHTML)== "OK ")
{
if(File.Exists(filename))
{
File.Copy(filename,filename+ ".bak ",true);
}
sw = new StreamWriter(filename,false,Encoding.GetEncoding( "GB2312 "));
sw.WriteLine(respHTML);
sw.Close();
status = 0;
}
else
{
System.Web.HttpContext.Current.Response.Write( "找不到该页或服务器错误 ");
}
}
catch(Exception err)
{
System.Web.HttpContext.Current.Response.Write(err.Message);
status = -1;
}
finally
{
if (sw != null)
{
sw.Close();
}
}
return(status);
}

public static string ReadHttp(string url,ref string content)
{
string status= "ERROR ";
HttpWebRequest Webreq = (HttpWebRequest) WebRequest.Create(url);
HttpWebResponse Webresp=null;
StreamReader strm = null;
try
{
Webresp = (HttpWebResponse) Webreq.GetResponse();
status = Webresp.StatusCode.ToString();
strm = new StreamReader(Webresp.GetResponseStream(),Encoding.GetEncoding( "GB2312 "));
content = strm.ReadToEnd();
}
catch
{
}
finally
{
if(Webresp != null) Webresp.Close();
if(strm != null) strm.Close();
}
return(status);
}

免责声明： 本文仅代表作者个人观点，与爱易网无关。其原创性以及文中陈述文字和内容未经本站证实，对本文以及其中全部或者部分内容、文字的真实性、完整性、及时性本站不作任何保证或承诺，请读者仅作参考，并请自行核实相关内容。

相关资料更多>

Chinajiyong进来一下。该如何解决

C#树菜单的有关问题

-谈小弟我这几年的痛苦遭遇-

有朋友有程序员开发网的账号帮个忙？该如何解决

求名词解释,该如何解决

对"F\\测试"途径的访问被拒绝

请问一个Panel拖对的有关问题

.Net中正則表達式取特定table中的文本,该如何处理

求SAP安装包的51036905_part3.rar，51037623_part06.rar，51037623_part13.rar三个文件,该怎么处理

求C＃提取网页正文内容代码解决思路

相关资料更多>

推荐阅读更多>