PHP cURL getting plaintext

10.4k Views Asked by At

I'm using cURL in PHP to get webpage content and it's working great. But my program now requires me to get only plain-text from the target site, instead of HTML. I've looked extensively for this but no one seems to have the answer. I've also tried setting CURLOPT_TRANSFERTEXT=1 in my cURL options, but this seems to have no effect on the results.

Any idea why this is happening?

4

There are 4 best solutions below

2
On

Make sure you have the following:

curl_setopt($curl, CURLOPT_RETURNTRANSFER, true);

The return value of curl_exec should then be the data.

2
On

I think it is what you are looking for:

<?
$address = file_get_contents('http://www.thesite.com/file.html');
echo $address;



$file = file_get_contents('./file.txt', true);

 or 

$file = file_get_contents('./file.txt', FILE_USE_INCLUDE_PATH);

?>
0
On

Actually I do like this to get the content of another page.

<?php
function curl_get_file_contents($URL)
    {
        $c = curl_init();
        curl_setopt($c, CURLOPT_RETURNTRANSFER, 1);
        curl_setopt($c, CURLOPT_URL, $URL);
        $contents = curl_exec($c);
        curl_close($c);

        if ($contents) return $contents;
            else return FALSE;
    }


echo curl_get_file_contents("http://www.php.net");
?>

But it returns to me the whole content.

0
On

Do you mean that you want to strip off all the HTML tags on the page?

Simply use the strip_tags() function.